* 
. 
. ’ ’ 
»* \e ’ 
‘ 
¢ 
, & 
4 ‘ 
‘ - 
= + 
, » 
° e a 
, ¢ 
- 
’ J ; 
. 
- 
. , 


TENTTRLGaR Ss 


WORLD SCIENTIFIC 
ed. SANKAR K. PAL 


PERSPECTIVES 
IN 
MATHEMATICAL 
SCIENCES 


2 
PURE MATHEMATICS 


Foreword 


The Indian Statistical Institute (ISI) was established on 17th December, 
1931 by a great visionary Professor Prasanta Chandra Mahalanobis to pro- 
mote research in the theory and applications of statistics as a new scien- 
tific discipline in India. In 1959, Pandit Jawaharlal Nehru, the then Prime 
Minister of India introduced the ISI Act in the parliament and designated it 
as an Institution of National Importance because of its remarkable achieve- 
ments in statistical work as well as its contribution to economic planning. 

Today, the Indian Statistical Institute occupies a prestigious position 
in the academic firmament. It has been a haven for bright and talented 
academics working in a number of disciplines. Its research faculty has done 
India proud in the arenas of Statistics, Mathematics, Economics, Computer 
Science, among others. Over seventy five years, it has grown into a massive 
banyan tree, like the institute emblem. The Institute now serves the na- 
tion as a unified and monolithic organization from different places, namely 
Kolkata, the Headquarters, Delhi, Bangalore, and Chennai, three centers, 
a network of five SQC-OR Units located at Mumbai, Pune, Baroda, 
Hyderabad and Coimbatore, and a branch (field station) at Giridih. 

The platinum jubilee celebrations of ISI have been launched by 
Honorable Prime Minister Prof. Manmohan Singh on December 24, 2006, 
and the Government of India has declared 29th June as the “Statistics Day” 
to commemorate the birthday of Professor Mahalanobis nationally. 

Professor Mahalanobis, was a great believer in interdisciplinary research, 
because he thought that this will promote the development of not only 
Statistics, but also the other natural and social sciences. To promote inter- 
disciplinary research, major strides were made in the areas of computer 
science, statistical quality control, economics, biological and social sciences, 
physical and earth sciences. 

The Institute’s motto of “unity in diversity” has been the guiding prin- 
ciple of all its activities since its inception. It highlights the unifying role 
of statistics in relation to various scientific activities. 
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In tune with this hallowed tradition, a comprehensive academic pro- 
gramme, involving Nobel Laureates, Fellows of the Royal Society, Abel 
prize winner and other dignitaries, has been implemented throughout the 
Platinum Jubilee year, highlighting the emerging areas of ongoing frontline 
research in its various scientific divisions, centers, and outlying units. It 
includes international and national-level seminars, symposia, conferences 
and workshops, as well as series of special lectures. As an outcome of these 
events, the Institute is bringing out a series of comprehensive volumes in 
different subjects under the title Statistical Science and Interdisciplinary 
Research, published by the World Scientific Press, Singapore. 

The present volume titled Perspectives in Mathematical Sciences IT: 
Pure Mathematics is the eighth one in the series. The volume consists of 
fifteen chapters, written by eminent mathematicians from different parts 
of the world. These chapters cover a wide range of topics and provide a 
current perspective of different areas of research, emphasizing the major 
challenging issues. Some of the articles are written keeping the students 
and the young researchers of mathematics in mind. I believe the state-of- 
the art studies presented in this book will be very useful to both researchers 
as well as practitioners. 

Thanks to the contributors for their excellent research contributions, 
and to the volume editors Profs. N. S. Narasimha Sastry, T. S$. S. R. K. Rao, 
M. Delampady and B. Rajeev for their sincere effort in bringing out the 
volume nicely in time. Initial design of the cover by Mr. Indranil Dutta is 
acknowledged. Sincere efforts by Prof. Dilip Saha and Dr. Barun Mukhopad- 
hyay for editorial assistance are appreciated. Thanks are also due to World 
Scientific for their initiative in publishing the series and being a part of the 


Platinum Jubilee endeavor of the Institute. 


December 2008 Sankar K. Pal 
Kolkata Series Editor and 
Director 


Preface 


Indian Statistical Institute, a premier research institute founded by Pro- 
fessor Prasanta Chandra Mahalanobis in Calcutta in 1931, celebrated its 
platinum jubilee during the year 2006-07. On this occasion, the institute 
organized several conferences and symposia in various scientific disciplines 
in which the institute has been active. 

From the beginning, research and training in probability, statistics and 
related mathematical areas including mathematical computing have been 
some of the main activities of the institute. Over the years, the contributions 
from the scientists of the institute have had a major impact on these areas. 

As a part of these celebrations, the Division of Theoretical Statistics and 
Mathematics of the institute decided to invite distinguished mathematical 
scientists to contribute articles, giving “a perspective of their discipline, 
emphasizing the current major issues”. A conference entitled “Perspectives 
in Mathematical Sciences” was also organized at the Bangalore Centre of 
the institute during February 4-8, 2008. 

The articles submitted by the speakers at the conference, along with 
the invited articles, are brought together here in two volumes (Part I and 
Part IL). 

Part I consists of articles in Probability and Statistics. Articles in Statis- 
tics are mainly on statistical inference, both frequentist and Bayesian, for 
problems of current interest. These articles also contain applications illus- 
trating the methodologies discussed. The articles on probability are based 
on different “probability models” arising in various contexts (machine learn- 
ing, quantum probability, probability measures on Lie groups, economic 
phenomena modelled on iterated random systems, “measure free martin- 
gales”, and interacting particle systems) and represent active areas of re- 
search in probability and related fields. 

Part II consists of articles in Algebraic Geometry, Algebraic Number 
Theory, Functional Analysis and Operator Theory, Scattering Theory, 
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von Neumann Algebras, Discrete Mathematics, Permutation Groups, Lie 
Theory and Super Symmetry. 

All the authors have taken care to make their exposition fairly self- 
contained. It is our hope that these articles will be valuable to researchers 
at various levels. 

The editorial committee thanks all the authors for writing the articles 
and sending them in time, the speakers at the conference for their talks and 
various scientists who have kindly refereed these articles. Thanks are also 
due to the National Board for Higher Mathematics, India, for providing 
partial support to the conference. Finally, we thank Ms. Asha Lata for her 
help in compiling these volumes. 


October 16, 2008 N. S. Narasimha Sastry 
Te Os S. Ke Ks Tae 

Mohan Delampady 

B. Rajeev 
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Chapter 1 


Use of Resultants and Approximate Roots for Doing the 
Jacobian Problem 


Shreeram S. Abhyankar 


Mathematics Department, 
Purdue University, West Lafayette, IN 47907, USA 
ram@cs.purdue.edu 


This is an expository article giving a modified version of my talks at 
ISI-Kolkata and ISI-Bangalore. After sketching the history of the jaco- 
bian problem, I shall discuss the two basic tools which are employed in 
attacking this problem. The first is the theory of resultants which are 
usually coupled with discriminants. The second is the theory of approx- 
imate roots of polynomials which are inspired by the construction of 
square roots of positive real numbers. 


1.1. Introduction 


Two given bivariate polynomials are said to form a jacobian pair if their 
jacobian equals a nonzero constant, and they are said to form an auto- 
morphic pair if the variables can be expressed as polynomials in the given 
polynomials. By the chain rule we see that every automorphic pair is a 
jacobian pair. The jacobian problem asks if conversely every jacobian pair 
is an automorphic pair. It turns out that a useful method for attacking this 
problem is to study the similarity of polynomials. Two bivariate polyno- 
mials are similar means their degree forms, i.e., highest degree terms, are 
powers of each other when they are multiplied by suitable nonzero con- 
stants. Geometrically this amounts to saying that the corresponding plane 
curves have the same points at infinity counting multiplicities. At any rate, 
the points at infinity correspond to the distinct irreducible factors of the 
degree form. 

Before getting into technicalities, I shall first give a short history of the 
problem or rather the history of my acquaintance with the problem. For 
that we have to go back to 1965 when a German mathematician, Karl Stein 
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who created Stein Manifolds, wrote me a letter asking a question. He said 
that there was an interesting 1955 paper in the Mathematische Annalen 
by Engel [9]. In this paper Engel claims to prove the jacobian theorem 
or what is now known as the jacobian problem or the jacobian conjecture 
or whatever. Karl Stein said to me that it is an interesting theorem but 
he cannot understand the proof. Can I help him? He also reduced it, or 
generalized it, to a conjecture about complex spaces. I wrote back to Stein 
giving a counterexample to his complex space conjecture. But I did not 
look at the Engel paper. Then in 1968, Max Rosenlicht of Berkeley asked 
me the same question and still I did not look at the Engel paper. Finally 
in 1970, my own guru (= venerable teacher) Oscar Zariski asked me the 
same question. Then, following the precept that one must obey one’s guru, 
I looked up the Engel paper and found it full of mistakes and gaps. 

The primary mistake in the Engel paper, which was repeated in a large 
number of published and unpublished wrong proofs of the jacobian problem 
in the last thirty-five years, is the presumed “obvious fact” that the order of 
the derivative of a univariate function is exactly one less than the order of 
the function. Being a prime characteristic person I never made this mistake. 
Indeed, the “fact” is correct only if the order of the function in nondivisible 
by the characteristic. Of course you could say that the jacobian problem is 
a characteristic zero problem, and zero does not divide anybody. But zero 
does divide zero. So the “fact” is incorrect if the order of the function is 
zero, i.e., if the value of the function is nonzero. Usually this mistake is 
well hidden inside a long argument, because you may start with a function 
which has a zero or pole at a given point and your calculation may lead to 
a function having a nonzero value at a resulting point. 

A gap is a spot where you are not sure of the argument because of 
imprecise definitions or what have you. The gap in the Engel paper seems 
to be the uncritical use of the Zeuthen-Segre invariant. For this invariant 
of algebraic surfaces see the precious 1935 book of Zariski [12]. Over the 
years I have made several attempts to understand the somewhat mysterious 
theory of this invariant, and I still continue to do so. 

In 1970-1977 I discussed the matter in my courses at Purdue and also 
in India and Japan. Mostly I was suggesting to the students to fix the 
proof and, to get them started, I proved a few small results. Notes of my 
lectures were taken down by Heinzer, van der Put, Sathaye, and Singh. 
These appeared in [1] and [4]. Then I put the matter aside for thirty years. 
Seeing that the problem has remained unsolved in spite of a continuous 
stream of wrong proofs announced practically every six months, I decided 
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to write up my old results, together with some enhancements obtained 
recently, in the form of a series of three long papers [6], [7], [8], in the Journal 
of Algebra, dedicated to the fond memory of my good friend Walter Feit. 
The ISI Jubilee Volume has given me a welcome opportunity of introducing 
these papers to the young students with an invitation to further investigate 
the problem. 

Now one of my old results says that the jacobian conjecture is equiva- 
lent to the implication that each member of a jacobian pair can have only 
one point at infinity. Another says that each member of any jacobian pair 
has at most two points at infinity. Note that the first result is a funny 
statement; it only says that to prove the jacobian conjecture, it suffices to 
show that each member of any jacobian pair has only one point at infinity. 
The second result is of a more definitive nature, and it remains true even 
when we give weights to the variables which are different from the normal 
weights. Very recently I noticed that, and this is one of the enhancements, 
the weighted two point theorem yields a very short new proof of Jung’s 
1942 automorphism theorem [10]. This automorphism theorem says that 
every automorphism of a bivariate polynomial ring is composed of a finite 
number of linear automorphisms and elementary automorphisms. In a lin- 
ear automorphsim both variables are sent to linear expressions in them. In 
an elementary automorphism, one variable is unchanged and a polynomial 
in it is added to the second variable. In his 1972 lecture notes [11], Nagata 
declared the automorphism theorem to be very profound and so it did come 
as a pleasant surprise to me that the weighted two point theorem yields a 
five line proof of the automorphism theorem. For other recent enhancements 
let me refer to my Feit memorial papers cited above. 

The present paper is only meant to whet the student’s appetite. At any 
rate the material of this paper is based on my recent talks in various places 
such as ISI-Kolkatta and ISI-Bangalore. 


1.2. Basic Technique 


Our basic technique in attacking the jacobian problem is the use of re- 
sultants and approximate roots. Resultants are usually coupled with dis- 
criminants; the theory of these two objects will be discussed in Section 3. 
Approximate roots are polynomial concepts coming out of the construction 
of square roots in the theory of real numbers; these two topics will be dis- 
cussed in Section 4. Here, assuming resultants and approximate roots, let 
us very briefly see how they are used in trying to do the jacobian problem. 
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Given any jacobian pair F(X,Y) and G(X,Y), by making a homoge- 
neous linear transformation we can arrange that they are monic polynomials 
of positive degrees N and M in Y respectively. Adjoin W to the ground 
field k and consider the algebraic closure K of k(W). Now eliminate Z by 
using the Z-resultant to get 


o(X,Y) = (-1)”** Resz(F(W, Z) — X, G(W, Z) —Y). 


Then ¢(X,7) is a monic polynomial of degree N in Y with coefficients 
in K[X]. Let pp = N and p; = M. Let dj = N and dz = GCD(p9, p1). 
Let Appp® denote the D-th approximate root of a monic polynomial ® 
whose Y-degree is a multiple E of D, i.e., Appp® is the unique monic 
polynomial © of Y-degree E/D such that degy(® — WU?) < E —(E/D). 
Let Wi(X,Y) = Y. For 2 <i <h let us inductively define 


Pi(X,Y) = Appa, O(X, Y) 
with 
pi = deg x Resy (G(X, Y), pi(X, Y)) 
and 
di+1 = GCD(po,.--, pi) 


so that dz > dg >--- > dni, = 1. Here h is called the number of charac- 
teristic pairs. 

By manipulating with this data, we can show that the polynomials F’ 
and G are similar. We can also show that if either h < 2 or h = 3 with 
d;, even then the given jacobian pair (FG) is an automorphic pair. As a 
consequence we can settle the jacobian conjecture when min(M, N) < 52. 
For details see [1] to [8]. 


1.3. Resultants and Discriminants 


The material of this Section is taken from pages 100-104 of my new Algebra 
Book [5]. Details of proof can be found on pages 172-188 of that Book. 

Beginning algebra students encounter discriminants of quadratic poly- 
nomials, but resultants are less well known. They were introduced by 
Sylvester around 1840. Actually in some sense we need to start with 
Descartes who introduced coordinates around 1637. Bézout built on these 
ideas to introduce his version of resultants for the original proof of Bézout’s 
Theorem. 
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Bézout’s Theorem, proved by him around 1770, is one of the oldest 
theorems of algebraic geometry. It says that a curve of degree m and a curve 
of degree n meet in mn points provided they have no common component 
and provided the intersections are counted properly. 

One way of proving Bézout’s Theorem is by using resultants. Vertical 
tangents can be located by using discriminants which are special cases of 
resultants. 

Assuming n,m to be nonnegative integers, the Y-Resultant of two poly- 
nomials 


F(Y) = ap¥" + a,¥" 3 +++ +a 
g(Y) = boY¥™ + bY" 1 +--+ +bm 


is the determinant 


Resy (f, g) = det(Resmaty (f, g)) 


of the n-+m by n+ ™ matrix 


ao ay . oe . An 0) ey tee ne 0 
0) ao ay:-: . . ay O« es @) 
00 ---ara St An 
Resmaty (f,g) = i oa 0 o« Q 
0 bo bi-+ + + bm O--- 0 
00 - -+ bo by : a om om oo (Dis 


where the first m rows consist of the coefficients of f and the last n rows 
consist of the coefficients of g. More precisely, the first row starts with the 
coefficients of f, these are shifted one step to the right to get the second 
row, shifted two steps to the right to get the third row, and so on for the 
first m rows, then the (m+ 1)-st row starts with the coefficients of g, these 
are shifted one step to the right to get the (m+ 2)-nd row, and so on 
for the next n rows. The matrix is completed by stuffing zeroes elsewhere. 
The determinant Resy(f,g) is sometimes called the Sylvester resultant of 
f and g because it was introduced by Sylvester in his 1840 paper where he 
enunciated the following: 
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BASIC FACT (T1). If the coefficients a;,b; belong to a domain R then we 
have: Resy(f,g) =0<@n+m #0 and either ajo = 0 = bo or f and g have 
a common root in some overfield of R. 

In case n > 0, the Y-Discriminant of f is defined to be the Y-Resultant 
of f and fy, i.e., 


Discy (f) = Resy (f, fy). 


where we view fy to be the polynomial 
fv(V) =napY"?* + (Qo —1)ayY¥"? +++ + apa 


i.e., we let the discriminant to be the determinant of the appropriate 2n — 1 
by 2n — 1 matrix without considering whether nag equals zero or not. 
From the Basic Fact (T1) we deduce the following: 


COROLLARY (T2). If n > 1 and the coefficients a; belong to a domain 
R then: Discy(f) = 0 © either a9 = 0 or f has a multiple root in some 
overfield of R. 


OBSERVATION (O1). [Resultant and Projection]. If X),...,Xy are 
indeterminates over a field k with N € Nx and R is either the polynomial 
ring k[X1,..., Xv] or the power series ring k[[X ,..., X~]], then Resy (f, g) 
equals a polynomial or power series ® = ®(X,,...,Xy). If ao and bo are in 
k* with nm 4 0 and k is algebraically closed then, in the polynomial case, 
by the Basic Fact it follows that the hypersurface ® = 0 in the N-space 
of (X1,...,Xy) is the projection of the intersection of the hypersurfaces 
f = 0 and g = 0 in the (N + 1)-space of (X1,...,Xw,Y). Moreover, 
without assuming k to be algebraically closed but assuming that ag and bo 
are nonzero constants, in the polynomial case as well as the power series 
case, by the Basic Fact it follows that: ® is identically zero (i.e., ® is the 
zero element of R) = f and g have a nonconstant common factor in R[Y]. 


OBSERVATION (02). [Discriminant and Projection]. Again if 
Xj 1,...,Xwy are indeterminates over a field k with N € N, and R is either 
the polynomial ring k[X1,..., Xj] or the power series ring k[[X1,..., Xy]], 
then Discy(f) equals a polynomial or power series A = A(X),...,Xwy). 
Now if ag is in k* with n > 1 and k is algebraically closed then, in the 
polynomial case, for all values (U;,...,Un) of (X1,..., Xn) in k, the equa- 
tion f = 0 has n roots which may or may not be distinct, and by the 
Corollary it follows that these roots are not distinct iff A(Ui,...,Un) = 0. 
In other words, when we project the hypersurface f = 0 in (N + 1)-space 
onto the N-space, above most points there lie n points, and A = 0 is the 
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locus of those points above which there lie less than n points. Moreover, 
without assuming k to be algebraically closed but assuming that ao is a 
nonzero constant, in the polynomial case as well as the power series case, 
by the Corollary it follows that: A is identically zero = f has a nonconstant 
multiple factor in R[Y]. 


OBSERVATION (03). [Isobaric Property]. View the coefficients aj, b; 
as indeterminates over Z. Give weight 7 to aj, and 7 to b;. Then 0 4 
Resy(f,g) € Zlao,...,@n,bo,.--,bm] is isobaric of weight mn, i.e., for the 
weight of any monomial a? ... ain b}° ... bi occurring in Resy (f,g) we have 
(ocren Tir) + Qoo<s<m 8Js) = mn. In particular, the principal diagonal 
ab” has weight mn, and it does not cancel out because there is no other 
term of b,,-degree n in the resultant; the principal diagonal of an N x N ma- 
trix (A;;) is the term Aj; Ago... Aww. The resultant being isobaric of weight 
mn is the fundamental fact behind various cases of Bézout’s Theorem. The 
following two Observations, where we use the set-up of Observation (O1), 
illustrate this for plane curves and general hypersurfaces respectively. 


OBSERVATION (O04). [Plane Bézout]. Let N = 1 with X = Xj, and 
assume that ag,bo are nonzero elements in k, and f,g are polynomials of 
total (X,Y )-degrees n and m respectively. By the isobaric property we see 
that then always degx ® < mn and “in general” degx ® = mn. Hence the 
n-degree plane curve f = 0 meets the m-degree plane curve g = 0 in mn 
points “counted properly.” The possibility of degx ® < mn is explained by 
saying that some intersections have “gone to infinity.” 


OBSERVATION (O05). [Hyperspatial Bézout]. Let N be general and 
assume that ao,b9 are nonzero elements in k, and f,g are polynomials 
of total (X1,...,Xw,Y)-degrees n and m respectively. By the isobaric 
property we see that then always deg:x,...x,y)® < mn and “in general” 
deg(x,,....xy)® = mn. Hence, in the (N + 1)-dimensional space, the n- 
degree hypersurface f = 0 and the m-degree hypersurface g = 0 meet along 
a “secundum” (= a subvariety of dimension two less than dimension of the 
ambient space) which projects onto the (mn)-degree hypersurface ® = 0 
in N-dimensional space. Again the possibility of deg(x, x,y) < mn says 
that some intersections have “gone to infinity.” 


EXAMPLE (X1). [Resultant and Discriminant in Terms of Roots]. 
If the coefficients a;,b; belong to a domain R and ag 4 0 ¥ bo then, upon 
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writing 
f(Y)=a0 [] (Y-ai) and g¥)=% J] (Y-4;) 
l<i<n 1<j<m 


with aj,...,Q@p and (1,..., 8m in an overfield of R, we have 


Resy (f, 9) = ag'bo I] I] (ag— Gj) 


1<i<n1<j<m 


= ag’ II g(a;) = (—1)""bG I] f(G;) 


1si<n l<jxm 


and 


Discy(f) = (-1)"""9ag (a — a,)?. 


1l<i<j<n 
EXAMPLE (X2). [Quadratic Resultant]. Considering the quadratic 
polynomials 
f(VY)=eY"7 +0¥ +e and g(VY)=e'Y? +0V +e 
and calculating the 4 x 4 determinant 


ab ¢0 
QOabe 
a’ b' c 0 
Od Ud 


we get Resy(f,g) = (a2c’? + a/c) + (b?a’c! + b/?ac) — (abb’c! + a’b'bc) — 
20cn ¢. 


EXAMPLE (X3). [Quadratic Discriminant]. Considering the quadratic 
f(VY) =aY?+bY +c 
and calculating the 3 x 3 determinant 


abe 
2a b 0 
0 2a b 


we get Discy(f) = —a(b? — 4ac). 
EXAMPLE (X4). [Cubic Discriminant]. Considering the cubic 
i) = aigY”® + oY" + aoY + a3 
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and calculating an appropriate 5 x 5 determinant we get 


Discy (f) = —ao(a?a2 — 4aga3 — 4a%a3 — 27a2a3 + 18a9a1a243). 


EXAMPLE (X5). [Special Quartic Discriminant]. Considering the 
quartic 


fWV)=Y¥*4+0Y? +oY +r 
and calculating an appropriate 7 x 7 determinant we get 


Discy (f) = 16p*r — 4p?q? — 128p?r? + 144pq?r — 27q* + 256r°. 


EXAMPLE (X6). [General Quartic Discriminant]. Considering the 
quartic 


FY ) = aoY* + aiY® + anY? +a3Y +a4 
and calculating an appropriate 7 x 7 determinant we get 


Discy (f) = ao(256aga? — 192a2?a,a3a7 — 128a2a%az + 144a2a2a2a4 
— 27a2a3 + 144a9a?.a2a74 — 6apa?aza4 — 80apa1a3a304 
+ 18a9a,a2a3 + 16agasa4 — 4agaza2 — 27ata? 


+ 18a?a2a3a4 — 4a3a3 — 4a?a3a4 + a?a%a2). 


1.4. Real Numbers and Approximate Roots 


The material of this Section is taken from pages 52-59 of my new Algebra 
Book [5]. 

Historically, the process of counting gave rise to the set N of all positive 
integers. Augmenting it by zero, this set was enlarged to get the set N of 
all nonnegative integers. Inserting the negative of everybody gave the full 
set Z of all integers. Finally the process of division gave rise to the set Q of 
all rational numbers. 

Taking this much for granted, we shall describe the limiting processes 
which gave rise first to the set R of real numbers and then to the set C of all 
complex numbers. The passage from Q to R was made precise by Cauchy 
by means of Cauchy Sequences around 1830, and by Dedekind by means 
of Dedekind Cuts around 1880. After sketching this development, we show 
how the construction of square roots of positive integers led us, around 
1975, to the construction of approximate roots of polynomials. Briefly, this 
solves the problem as to how close we can come to finding the D-the root of 
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a one-variable polynomial whose degree is a multiple of the positive integer 
Pp. 

As in the quoted Algebra Book, we shall divide the material into a 
series of Definitions and Exercises. For the basic definitions of terms and 
standard set theory symbols which we shall use, such as € for “element of,” 
the reader may consult the first few pages of the said Algebra Book, or any 
other current text-book of college mathematics. 


DEFINITION (D1). [Real and Complex Numbers]. Real numbers may 
be defined as equivalence classes of Cauchy sequences of rational numbers, 
and the complex number field C may then be defined as the splitting field 
of the quadratic polynomial Y?+1 over the real number field R. By analogy 
with N,, by Q, we denote the set of all positive rationals; moreover, by 
Qo+, Q_, and Qop_ we denote the set of all nonnegative rationals, nega- 
tive rationals, and nonpositive rationals respectively; as usual, the absolute 
value of any r € Q is denoted by |r], i-e., |r| = r or —r according as r € Qo+ 
or r € Q_; by context, this will not be confused with the size |S| of a set 9. 
A sequence & = (2; )1<i<oo in Q is Cauchy means for every e € Q4 there ex- 
ists N. € Nz such that for alli > N. and j > N,. we have |x; —2,;| < e. This 
is equivalent to the Cauchy sequence x’ = (2})1<i<oo, in symbols x ~ 2’, if 
for every €« € Q, there exists M,. € N, such that for all i > M,. we have 
|x; —24| < «. Now R may be defined to be the quotient Cg/ ~ of the set Cg 
of all Cauchy sequences in Q by the equivalence relation ~. We “identify” 
Q with a subset of R by sending any q € Q to the equivalence class of the 
Cauchy sequence (q;)1<icoo with q; = q for all 7. If y = (yi)1<i<oo is another 
Cauchy sequence in Q then the sequences (2; + Yi)i<icoo and (LiYi)1<icoco 
are Cauchy sequences whose equivalence classes are unchanged if # and y 
are replaced by equivalent Cauchy sequences; this makes R into a ring and 
in fact an overfield of Q. 


DEFINITION (D2). [Ordered Fields]. The order relation < can be ex- 
tended from Q to R by declaring that the equivalence class of x is < the 
equivalence class of y if the Cauchy sequences x and y in their equivalence 
classes can be chosen so that x; < y; for all 7; see (E3) below. Like Z and 
Q, this makes R an ordered domain, i.e., a domain whose underlying addi- 
tive abelian group is an ordered abelian group and in which the product of 
any positive elements (i.e., elements which are greater than zero) is again 
positive. Out of these Q and R are ordered fields, i.e., fields whose under- 
lying domains are ordered domains. We extend the notation Q+, Qo+, Q_, 
and Qp-— to any ordered abelian group G (and hence to ordered domains 
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and ordered fields) by putting Gi = {g € G: g > 0}, Goy = Gi U {0}, 
G_ ={ge€G:g < 0}, and Go_ = G_ U {0}; also we define the absolute 
value of any g € G by putting |g] = g or —g according as g € Goy or 
g € G_. In particular this defines the sets Ri, Roi, R-, Ro, and defines 
the absolute value |r| of any r € R. Note that now Z, = N, and Zo; =N. 
An ordered abelian group G is archimedean means for all 2,y in G; we 
have nx > y for some n € N_. Clearly R is an archimedean ordered field, 
i.e., an ordered field whose underlying additive ordered abelian group is 
archimedean. A sequence % = (2%;)1<i<oo in an ordered abelian group G 
is Cauchy means for every € € G there exists NV. € N+ such that for all 
i > N, and j > N, we have |x; — 2;| < €; the sequence x is convergent 
means it converges to a limit € € G, ie., for every € € Gy there exists 
M,. € Ny such that for all i > M, we have |€ — x;| < €; we indicate this 
by some standard notation such as 7; — € as i > o6 or lim;_.42%; = €. 
An ordered abelian group is complete means in it every Cauchy sequence 
is convergent. Clearly R is a complete field, i.e., an ordered field whose 
underlying additive group is complete as an ordered abelian group. 


DEFINITION (D3). [Torsion Subgroups and Divisible Groups]. By 
the subgroup of a group G generated by elements x71, 2%2,... in G we mean 
the smallest subgroup of G which contains these elements. The order of 
an element in a group is the order of the subgroup generated by it. The 
subgroup of an additive abelian group G generated by all of its elements of 
finite order is called the torsion subgroup of G; if this is zero then G is said 
to be torsion free. An additive abelian group G is divisible means for every 
g€Gand née Z* there is h € G with nh = g. 


EXERCISE (E1). Show that for any elements a, b in a torsion free additive 
abelian group and any nonzero integer n we have: na = nb > a = b. Show 
that any ordered abelian group is torsion free, and hence any ordered field 
is of characteristic zero. Show that for all x,y in an ordered field we have 
|xy| = |a||y|. Show that the usual order on Q is the only order on it which 
makes it an ordered field. 


EXERCISE (E2). Show that for all x,y in an ordered abelian group G we 
have |x + y| < |z|+|y|, and from this deduce that any convergent sequence 
in G is Cauchy and has a unique limit. 


DEFINITION (D4). [Rational Completions]. Given any torsion free ad- 
ditive abelian group G, let G* = G x Z* / ~ where the equivalence relation 
~ is given by: (g,n) ~ (g/,n’) = n'g = ng’, and embed G in G* by 
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identifying every g € G with the equivalence class containing (g, 1). De- 
fine addition in G* by taking equivalence classes in the proposed equation 
(g,n) + (h,m) = (mg + nh,nm). Note that then G* is a divisible additive 
abelian group such that for every g € G* we have ng € G for some n € Z*. 
We call G* the rational completion of G. Note that if G is divisible then 
G =G, 


EXERCISE (E3). In (D2) show that the induced relation < on the equiv- 
alence classes of Cauchy sequences in Q is a linear order. 


EXERCISE (E4). In (D4) show that the induced relation < on the equiv- 
alence classes of Cauchy sequences in an ordered abelian group is a linear 
order. 


EXERCISE (E5). Let G be any nonzero archimedean ordered abelian group. 
Show that given any g > 0 in G and xz > 0 in R, there exists a unique order 
monomorphism (i.e., a group monomorphism which is order preserving) 
@:G— Rsuch that ¢(g) = x, and there exists a unique order isomorphism 
(i.e., a group isomorphism which is order preserving) 7 : G* — R such that 
v(g) = «x. Moreover, for these maps we always have ¢(h) = Wh) for all 
weG. 


EXERCISE (E6). Show that 2+ x? gives a surjection R > Ro . 


HINT. The usual method of finding the decimal expansion 1.14142... of /2 
can be explained in terms of decimal expansions of integers by saying that 
<2? 14% <= 2x10? < 154, 1417 = 2% 10" = 1497, 1414 = 2x 10° < 
14157, 14142? < 2 x 10° < 141437, and so on. More generally let n > 1 
and d > 1 be any integers, and let y > 0 and 7 > 0 be any integers. Then 
clearly there is a unique integer x; such that 2? < yn™ < (a; + 1)%; 2; is 
1/dn*, Obviously 
the sequence (2;/n') is Cauchy and for its limit 2 in Ry we have 2% = y. 
Any positive rational can be written in the form y/z? where y and z are 
positive integers, and then we get 2/z € Ry with (x/z)? = y/z?. Finally, 
any 7 € Ry can be written as the limit of a sequence (7;) in Q; and then 


nothing but the n-adic expansion of the largest integer < y 


taking €; € R+ with é&¢ =; we get a Cauchy sequence (£;) for whose limit 
€ € R, we have 4 = n. 


DEFINITION (D5). [Rational Ranks]. In view of the first sentence of 
(E1), any torsion free divisible additive abelian group may clearly be re- 
garded as a Q-vector-space. The Q-vector-space dimension of the rational 
completion of a torsion free additive abelian group G is called the rational 
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rank of G and is denoted by r(G). Alternatively, r(G) may be character- 
ized as the cardinal of a maximal (= nonenlargeable) Z-linearly indepen- 
dent subset H of G, where independent means that for any finite number 
of distinct elements 71,...,%q in H and any integers n1,...,%q we have: 
me t+---tngtg =0>n1, =::- =n = 0. 


DEFINITION (D6). [Dedekind Cuts]. Instead of using Cauchy sequences 
to prove (E5), we can use Dedekind Cuts. So let G be a nonzero divisible 
archimedean ordered abelian group. A Dedekind cut of G is a pair (L,U) of 
nonempty subsets of G with U = G\ L such that for alll € L and u € U we 
have 1 < wand there is no u’ € U with U = {ue G: wu’ < u}. ForanyteG 
we get a Dedekind cut (Z,, Uz) with LD, = {16 G:l<thandUu,={ueG: 
t <u}. Let Dg be the set of all Dedekind cuts of G. It can be shown that 
G is complete = t + (Lt, Rz) gives a surjection G — Dg. Indeed, Dg may 
be defined to be the completion of G. At any rate, for proving (E5), given 
any h € G let 0(h) be the real number which corresponds to the Dedekind 
cut (L,U) of Q where L = {m/n € Q with m € Z and n € Ny: nh < mg} 
and U = Q\ L, and take ¢(h) = x0(h). 


DEFINITION (D7). [Approximate Roots]. In the Hint to (E6) we showed 
how to use n-adic expansions of positive integers to find successive approx- 
imations to the d-th root of a positive integer. Mixing a generalization of 
this with a generalization of the completing the square method of solv- 
ing quadratic equations leads us to the concept of approximate roots of 
polynomials. So consider a monic polynomial 


PoP@ysy"+ >) Aye 
1<i<N 
of degree N > 0 in Y with coefficients A; in a ring R. If N is a unit in R 
then we can generalize the completing the square idea to completing the 
N-th power by writing 


F(Y) = (4A ly” + S- ALY +. Ain * 
2<i<N 


with A‘ € R, i.e., by killing the coefficient of YN~!. [On page 58 of my 
Lectures on Algebra Volume I, inadvertently the two plus signs in the above 
display have been printed as minus signs and, three lines above that, N > 0 
has been printed as N > 0]. To generalize this further let D > 0 be an 
integer which divides NV. Instead of assuming N to be a unit in R, assume D 
to be a unit in R; note that in case of a field R this is equivalent to assuming 
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that the characteristic of R does not divide D and so characteristic zero is 
always ok. Now we look for a monic polynomial 


G=GY)=Y"P+ SO Byer 
1<i<N/D 


of degree N/D in Y with coefficients B; in R such that G? is as close to 
being equal to F as possible. As (E7) below shows, if we interpret this as 
requiring degy (F—G?) < N—(N/D) then a unique G exists, and we call it 
the approximate D-th root of F' (relative to Y) and denote it by Appp(£’) or 
Appp,y (F). Recall that for any m,n in N with n > 1, the n-adic expansion 
of m consists of writing m = pee mn’ where integers 0 < m; < n are the 
digits of the expansion. Likewise for any f,g in R{Y]| with g monic of positive 
Y-degree, the g-adic expansion of f consists of writing f = > ;., fig’ where 
fi € R[Y] with degy f; < degyg are the digits of the expansion. By (E8) 
below these expansions exist and are unique. Moreover, if f is monic of 
Y-degree N > 0 and the Y-degree of g is N/D € Ny where DEN, isa 
unit in R, then fp = 1 and f; = 0 for all i > D; we try completing the 
D-th power by putting ts(g) = Ty,y(g) = 9 + (fp-1/D) and calling it the 
f-Tschirnhausen of g (relative to Y). This references to the 1683 work of 
Tschirnhausen who was a friend of Leibnitz. By (E9) below, starting with 
any monic g of degree N/D and applying Ty to it N/D times will produce 
the approximate D-th root of f. 


EXERCISE (E7). Let F be a monic polynomial of degree N > 0 in Y over 
aring R. Let D > 0 be an integer such that D divides N and D is a unit in 
R. Show that there exists a unique monic polynomial G of degree N/D in Y 
over R such that degy(F—G”) < N—(N/D). Hint: With display as in (D7), 
the last condition gives the equations A; = DB; + P;(Bi,..., By—1) for 1 < 
i < N/D where the coefficient of YY ~* in G? equals DB;+P;(Bi,..., Bi-1) 
with P; a polynomial over Z; since D is a unit in R, these can be solved 
successively (in a unique manner). 


EXERCISE (E8). Given integers m > 0 and n > 1, show the unique exis- 
tence of the n-adic expansion of m. Given univariate polynomials f,g over a 
ring R with g monic of positive degree, show the unique existence of the g- 
adic expansion f = )>,., fig’ of f. Show that if f is monic of degree N > 0 
and the degree of g is N /D where D is a positive integer factor of N, then 
fo =1 and f; = 0 for alli > D, and moreover: Appp(f) = 9 © fp-1 = 0. 


EXERCISE (E9). Let f,g be univariate monic polynomials of positive de- 
grees N and N/D over a ring R where D is a positive integer which di- 
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vides N, and is a unit in R. Let rs(g) = g, and let f = Doycien fig’ and 
7 = > pax ola be the g-adic and g-adic expansions of f respectively. 
Show that if fp—1 404 fp_; then deg(fp_,) < deg(fp_1). From this 
deduce that Te! (g) = Appp(f). 


Epilogue 


MANGALACHARAN 

ATA VISHVATMAKE DEVE | YENE VAGYADNYE TOSHAVE 
TOSHONI MAJA DYAVE | PASAYDANA HE 
GANITAVIDYECHEE JAGRUTEE | KARONTYA SARVA JAGATEE 
PRADNYASURYE UJALATEE | SUKHAVAYA SAKALA JANA 


Here is a free Paraphrase of the above MANGALACHARAN = 
INVOCATION in my mother tongue MARATHI whose founding father 
DHYANESHVAR composed the first two lines around 1250 A.D. to which 
I added the last two lines. 


PARAPHRASE. May the Lord God of the Universe be pleased with my 
recounting of the story of algebra and geometry which are the essence of our 
beloved subject of mathematics. Being pleased may he shower his blessings 
upon us and make our endeavor pleasurable. 
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2.1. Introduction 


Let M be a smooth manifold and Eg a C™ principal G—bundle over M, 
where G is a Lie group. The Lie algebra of G will be denoted by g. Let V 
be a connection on Eq. Therefore, V is a g-valued smooth one—form on the 
total space of Eg satisfying the following two conditions: 


e the map TEg — g defined by V intertwines the action of G on 
Eg and the adjoint action of G on g, and 
e the restriction of V to any fiber of Eg is the Maurer—Cartan form. 


The connection V defines parallel transport on Eg. In other words, 
given a smooth curve 


y: [0,1] — M 
and a point z in the fiber (Eq).(9) of Eg over the point (0), there is a 
unique smooth map 

7, : [0,1] — Ec (2.1) 
that satisfies the following three conditions: 


e poy: = x, where p : Eg — M is the natural projection, 
e ¥- is horizontal with respect to V, which means that the image of 
the differential dy, lies in the kernel of the form V, and 


e 4,(0) = z. 
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Any ¥z as in Eq. (2.1) satisfying the first two of the above three condi- 
tions is called a horizontal lift of y. 

Fix a point a9 € M, and also fix a point z € (EcG)x,- Consider all 
curves y as above with y(0) = xo, and consider their horizontal lifts ¥ with 
7(0) = zo. Let Z denote the closure of the subset of Eg consisting of all 
points of the form ¥(1), with 7 of this type. There is a closed subgroup H 
of G such that a point z € (Ea). lies in Z if and only if z = zog for some 
g € H. More precisely, 


Ey = ZC He (2.2) 


is a smooth reduction of structure group of Eg to H, and H is a closed 
subgroup of G. 

Note that if we replace the base point zo by any other point of Z, then 
both H and Z remain unchanged. If we replace z) by zogo, where gop € G 
is a fixed point, then Z gets replaced by Zgo, and H gets replaced by 
the group go 'Hgo. Hence the conjugacy class of the subgroup H C G is 
independent of 29 and zo (it depends only on V). 

Let Ad(Eqg) = Eq x° G be the adjoint bundle. So Ad(Eg) is the fiber 
bundle over M associated to Eg for the adjoint action of G on itself. Hence 
the fibers of Ad( Eq) are groups isomorphic to G. More precisely, each fiber 
of Ad(Eca) is identified with G uniquely up to an inner automorphism of 
G. Consider 


Ad(Ex) C Ad(Ec), (2.3) 


where Ey is defined in Eq. (2.2). Since H gets replaced by gp ‘Ho if zo is 
replaced by zogo, the subgroup 


Ad(Ex)z, C Ad(Ee)ao 


is independent of the choice of zo; it depends only on V and 29. The sub- 
bundle Ad(Ey) of Ad(£g@) in Eq. (2.3) depends only on V. 

The subbundle Ad(£7,) in Eq. (2.3) is called the holonomy bundle, and 
the subgroup Ad(Fy),, of Ad(EG)zx, is called the holonomy group for V. 
Note that the holonomy group depends on 20. 

The conjugacy class of subgroups of G' defined by Ad(F 7), (recall that 
Ad(Eg@)zx, is identified with G up to an inner automorphism) is independent 
of the choice of xo. 

Now let X be a smooth projective variety defined over a field k, and 
let G be a linear algebraic group defined over k. Let Eq be a principal 
G-bundle over X. Under some semistability assumption on Eg, and also 
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an assumption that X admits a k-rational point, a canonical holonomy 
group of Eg exists, and also a holonomy bundle of Eg exists. Note that no 
reference to any connection on Eg is made. 

The constructions of the above mentioned holonomy group and the 
holonomy bundle are based on a work of Nori. Our aim here is to describe 
these constructions. 

In [5] and [6], the term monodromy is used instead of holonomy. We will 
stick to the terminology of [5] and [6]. 

In [1] an interesting construction of holonomy group (= monodromy) 
and holonomy bundle of polystable vector bundles over complex projective 
varieties is given (see also [2], [3]). An application of holonomy is given 
is [4]. 

We will mainly describe results from [6]. In the last section, we will very 
briefly describe some work in progress. 


2.2. Tannakian Category 


In this section we will briefly recall some results on Tannakian category 
that we need. This section is reproduced from [5]. 

Let k be any field and H an affine algebraic group scheme defined over 
k. We will denote by H—mod the category of finite dimensional rational left 
representations of H. The tensor product operation of two H—modules will 
be denoted by ®. We denote by k—mod the category of finite dimensional 
vector spaces over k, and we denote by ® the usual tensor product operation 
on k—mod. Let 


T : H-mod — k-mod 


be the forgetful functor. Let O be a trivial one dimensional representation 
of H. The quadruple (H—mod, ®,T, ©) determines H in the following sense 
(see [14, Theorem 1], and [11, Section 2.1, Theorem 1.1]). 


Theorem 2.1. ({11], [14]) Let C be a category with a distinguished object 
O equipped with an operation 


®:CxC—C 
and T : C —+ k-mod a functor satisfying the following eight conditions: 


(C1) C is an abelian category with direct sums; 
(C2) isomorphism classes of objects of C form a set; 
(C3) T is an additive, faithful and exact functor; 
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(C4) & is k-linear in each variable, and To ® = ®0(T xT); 

(C5) & is associative preserving T; 

(C6) & is commutative preserving T; 

(C7) the object O of C is equipped with an isomorphism ¢ : k —>+ T(O) 
such that O is an identity object of ® preserving T; 

(C8) for every object L of C such that T(L) is one dimensional, there is 
an object L~! of C such that LQL~! is isomorphic to O. 


Then there exists a unique affine algebraic group scheme H defined 
over k- such that the quadruple (C, ®, T, O) is identified with the quadru- 
ple (H-mod, ®,T, ©). 


Any quadruple (C,®,T,O) as in Theorem 2.1 satisfying all the condi- 
tions in Theorem 2.1 will be called a neutral Tannakian category (see [11, 
p. 76]). 

Let H, and Hz be two affine algebraic group schemes. An algebraic 
group homomorphism H, — > Hg is defined by a morphism of neutral 
Tannakian categories 


C, : Ha-mod — H\-mod (2.4) 


compatible with @, T and O. Such a morphism C7 defines a closed embed- 
ding of H; in H2 if and only if every object of H,;—mod is isomorphic to a 
subquotient (in H,—mod) of the image of an object of H2-mod under C7. 
This is proved in [7, p. 139, Proposition 2.21(b)]. 

Next we recall a description of principal bundles over a scheme S defined 
over k using the language of categories and functors in a similar spirit. Let 
H be an affine algebraic group scheme defined over k and Ey — Sa 
principal H-bundle over S. Then for every object 


p: H — GL(V) 


in H—mod we can construct the associated vector bundle E, := Ex ye _y 
over S. Here Ey x#¥V = (Ey xV)/H with the action of any h € H sending 
any (z,v) € Ey xV to (zh, p(h~+)v) (a point of a group scheme H over k 
means a R-valued point for some k-algebra R). This defines a functor Fz 
from the category H—mod to the category of vector bundles Vect(S') on S. 
Nori proved, in [11, Section 2.2], that this functor determines the principal 
H-bundle in the following sense: 


Theorem 2.2. ({11]) Let F : H-mod — Vect(S) be a functor satisfying 
the following: 
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(F1) F is a k-additive exact functor; 

(F2) Fo@=@0(F xf); 

(F3), the functor F preserves commutativity, or in other words, if c is the 
canonical isomorphism of V@W with WV in H-mod, then F(c) 
is the canonical isomorphism of the corresponding vector bundles; 

(F3)2 the functor F preserves associativity, which means that if a is the 
canonical isomorphism of U@(V@W) with (U@V)@W in H-mod, 
then F(a) is the canonical isomorphism of the corresponding vector 
bundles; 

(F3)3 the vector bundle F(O) is the trivial line bundle Os on S; 

(F4) for any V € H-mod of dimension n, the vector bundle F(V) is of 
rank n. 


Then there exists a unique principal H-bundle E —> S' such that F is 
identified with Fr. 


Let 
p> My —» Ho (2.5) 
be a group homomorphism defined by the functor 
Cy, : Ha-mod — H,-—mod 


as in Eq. (2.4). Suppose E; —— S is a principal H;-bundle defined by a 
functor F, : Hi-mod — Vect(S). We define 


F2 : H2—mod — Vect(S) 


by Fo := F,0Cy,. Since both F; and C; are additive, exact, preserving 
commutativity and associativity it follows that F2 satisfies all the conditions 
(F1), (F2), (F3); and (F'3)2 in Theorem 2.2. For the remaining conditions 
in Theorem 2.2, recall that C7 takes O to O, and it preserves the rank of the 
modules. In view of this, the functor F2 satisfies both (F'3)3 and (F'4) as the 
functor F, does so. Consequently, F2 defines a principal H2-bundle over S; 
the principal H2-bundle constructed this way will be denoted by E2. This 
principal H2-bundle F2 is identified with the principal H2-bundle EF x“! Ho 
obtained by extending the structure group of FE; using the homomorphism 
p in Eq. (2.5). We recall that E, x"! Hy = (FE, x H2)/H;1, where the action 
of any h € Hy sends any (z,h2) € Ey x He to (zh, p(h~")ha). 


Definition 2.1. Let E2 be a principal H2-bundle over S given by a functor 
Fz as in Theorem 2.2. A reduction of structure group to a closed subgroup 
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scheme H, C Hg is a principal H,-bundle FE; given by a functor Fy, that 
satisfies the identity 


Fo = Frio Cr; 
where C7 is the functor as in Eq. (2.4) defining the subgroup Hi @ Ho. 


Giving a reduction of structure group of a principal H2-bundle E2 over S$ 
to a closed subgroup scheme H, C Hg is equivalent to giving a subscheme 


EE, C Eo 


such that the action of H; on Eg preserves E, making it a principal Hj- 
bundle over S. 


2.3. A Tannakian Category for a Pointed Curve 


Fix a field k. Let X be a geometrically irreducible smooth projective curve 
defined over this field k. 

The notion of a semistable vector bundle was introduced by Mumford; 
we will recall the definition. 

A vector bundle FE over X is called semistable if for every subbundle 
V Cc E of positive rank, the inequality 


degree(V ) se degree(E) 
rank(V) ~  rank(£) 


(2.6) 


holds. If 
degree(V ) z degree(F) 
rank(V ) rank(£) 
for all subbundles V with 0 < rank(V) < rank(£), then EF is called stable. 
The rational number eer is called the slope of E, and it is customary 
to denote it by u(E). 


Let @ be a field extension of k. If a vector bundle F over X admits a 
subbundle V with u(V) > p(£), then for the subbundle V ®), @ of the 
vector bundle FE ®), ¢ over X Xx, @, the inequality u(V ®&, 4) > u(L &, 2) 
holds. In other words, if F is not semistable, then EF ®), @ is not semistable. 
The following proposition shows that the converse is also valid. 


(2.7) 


Proposition 2.1. Let ¢ be a field extension of k. A vector bundle E over 
X is semistable if and only if the vector bundle E®, & over X x, & is 
semistable. 
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The above proposition is proved in [9] under the assumption that k is 
infinite (see [9, p. 97, Proposition 3]), and it is proved in [8] under the 
assumption that k is perfect (see [8, p. 222]). Hence the proposition holds 
for all fields. 

If the characteristic of k is p, with p > 0, then we have the Frobenius 
map 


Fy :k —k 
defined by A +— A”. Consider the Cartesian diagram 
Xx X% > X 


| | (2.8) 


Spec(k) iA Spec(k) 


where and 7 is the relative Frobenius map (see [13, p. 118]). The composi- 
tion yor will be denoted by Fx. If the characteristic of k is zero, then F'x 
will denote the identity map of X. 

A semistable vector bundle W over X is called strongly semistable if the 
iterated pull back 


n—times 


a 
(Fx 0---0 Fx)*W 


is semistable for all n > 1. 

So if the characteristic of k is zero, then any semistable vector bundle 
is strongly semistable. We note that strongly semistable vector bundles are 
usually defined under the assumption that the base field is perfect. In view 
of Proposition 2.1, the above definition is compatible with it. 

Let Sx denote the category of all strongly semistable vector bundles on 
X. Let Cx denote the space of all maps 


f : Q — Obj(Sx) (2.9) 
satisfying the following two conditions: 


e f(A) = 0 for all but finitely many rational numbers, and 
e For any A € Q with f(A) ¥ 0, 


W(f(A)) = A. 


We will define direct sum and tensor product, as well as dual, of objects 


of Cx. 
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If E,F € Sx (see Eq. (2.9)) with u(F) = u(F), then the direct sum 
E@ F is also strongly semistable with w(E @ F) = p(£). Also, the dual 
vector bundle E* is strongly semistable with w(£*) = —p(£). 

For any f,g € Cx, define f Bg € Cx by 


A> F(A) ® 9g). 
Define the dual f* of f by 
At f(-A)*. 


For V,W € Sx of positive ranks, the vector bundle V ® W is also 
strongly semistable ([12, p. 288, Theorem 3.23]). We also note that 


WV @®W) = WV) + u(W). (2.10) 
For any f,g € Cx, define 
(f@9 A) = Ore) —2z). (2.11) 
zEQ 
For any two objects f,g € Cx, an element of 
Hom(f ,9) = GB A°(X, Hom( f(y) ,9(7))) (2.12) 
yEQ 


will be called a homomorphism from f to g. Any h € Hom(f ,g) will also 
be considered as a map 


h: Q— QA, Hom(F(1) ,9(7))) 
EQ 
such that h(y) € H°(X, Hom(f(y),9(y))) for all 7. 
A homomorphism h from f to g will be called an isomorphism if 


h(y) : f(y) — 9(y) 


is an isomorphism for all y € Q. 

Take any object f in the category Cx. A sub-object of f is an object f’ 
in Cx such that for each A € Q, the vector bundle f’(A) is a subbundle of 
the vector bundle f(A). If f’ is a sub—object of f, then the object of Cx that 
sends any A € Q to the quotient vector bundle f(A)/f’(A) will be called 
a quotient—object of f. For any object f of the category Cx, a sub-quotient 
of f is a sub—object of some quotient—object of f. 

Let p : V — W be a homomorphism between strongly semistable 
vector bundles V and W over X with 


WV) = WW). (2.13) 
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Then either y is injective, or kernel(y) is a strongly semistable vector bun- 
dle over X with pu(kernel(y)) = p(V). Similarly, either vy is surjective or 
cokernel(y) := W/(V) is a strongly semistable vector bundle over X with 
p(cokernel(¢)) = p(W). 

Take any f,f’ € Cx and any h € Hom(f, f’) (see Eq. (2.12)). Con- 
sider the function Q —> Sx that sends any A € Q to the kernel of the 
homomorphism 


h(A) : fA) — f'Q). 
This function defines an object of Cx, which we will call the kernel of h, 
and it will be denoted by kernel(h). So kernel(h) is a sub-object of f. 


Similarly, the object of Cx defined by function Q — Sx that sends 

any A € Q to the cokernel of the homomorphism 
h(A) : f(A) — FA) 

will be called the cokernel of h, and it will be denoted by cokernel(h). So 
cokernel(h) is a quotient—object of f’. 

We note that Cx is an abelian category. 

We will henceforth assume that the curve X admits a k—rational point. 
Fix a k-rational point x of X. 

For any vector bundle V over X, the fiber of V over x will be denoted 
by Ve. 

The category of finite dimensional vector spaces over the field k will be 
denoted by k—mod. 

We have a functor 


w:Cx — k-mod (2.14) 
defined by 


zEQ 

Let Ox be the trivial line bundle over X defined by the structure sheaf 
of X. The object in Cx that sends any A € Q\ {0} to the vector bundle of 
rank zero and sends 0 to Ox will also be denoted by Ox. 

The triple (Cx ,Ox ,w), where w is defined in Eq. (2.14) and Ox € Cx 
is the object defined above, together form a neutral Tannakian category 
over k; see [7], [10] for neutral Tannakian category. Any neutral Tannakian 
category over k gives an affine group scheme defined over k (see Theorem 
2.1). Hence the triple (Cx ,Ox ,w) produce an affine group scheme defined 
over k. 
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Let Gx denote the group scheme defined over k given by the neutral 
Tannakian category (Cx ,Ox ,w). 

We will now show that there is a tautological principal Gx—bundle over 
des 

Let Vect(X) denote the category of vector bundles over X. We have a 
functor 


Fx : Cx — Vect(X) , (2.15) 
defined by 


fr> @ f(2. 
A=—0o 
Using [11, Lemma 2.3, Proposition 2.3], the functor Fx in Eq. (2.15) defines 
a principal Gx—bundle over X, where Gx is the affine group scheme defined 
above (see Theorem 2.2). 
Let Eg, denote the tautological Gx—bundle over X constructed above. 


2.4. Monodromy of a Strongly Semistable Principal Bundles 


As before, k is any field, X is a geometrically irreducible smooth projective 
curve defined over k and x is a k-rational point of X. 

Let G be a linear algebraic group defined over the field k. Let Z(G) 
denote the connected component, containing the identity element, of the 
reduced center of G. 

We assume that G satisfies the following condition: there is no nontrivial 
character of G which is trivial on Z(G). 

Let 


BiG) € ZUG) (2.16) 


be the (unique) maximal split torus contained in Z{(G). Let Zo(G)* de- 
note the group of all characters of Z(G). The subgroup Z(G) gives a 
decomposition of any G—module, which we will describe now. 

Let V be a finite dimensional left G-module. Let be the decomposition 


V= @ Ky, (2.17) 
xEZo(G)* 


into isotypical components of the Zo(G)—module V. So for any character x 
of Z(G), 


Vee ¥ 
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is the subspace on which Z(G) acts as scalar multiplications through the 
character x. It is easy to see that each Vy is left invariant by the action of 
GonV. 

See [6] for the proof of the following lemma. 


Lemma 2.1. Take any character x € Z(G)*. Let V and W be two 
nonzero finite dimensional left G-modules such that Zo(G) acts on both 
V and W as scalar multiplications through the character x. Let Eg be a 
principal G—bundle over X. Let Ey (respectively, Ew) be the vector bun- 
dle over X associated to the principal G—bundle Eq for the G—module V 
(respectively, W ). Then 


u(y) = w(Ew). 
The following is a corollary of [7, p. 139, Proposition 2.21]. 


Corollary 2.1. All characters of Zo(G) arise from the indecomposable rep- 
resentations of G. In other words, for any character x of Zo(G), there is 
some nonzero finite dimensional indecomposable left G-module V such that 
Zo(G) acts on V as scalar multiplications through the character x. 


If EF and F are two vector bundles over X, then p(E® F) = u(E) + 
u(F’). Therefore, Lemma 2.1 and Corollary 2.1 combine together to give 
the following corollary: 


Corollary 2.2. Fix any principal G-bundle Eq over X. Then there is a 
homomorphism to the additive group 


OEG 7 Zo(G)* —? Q 


that sends any character y to u(Ey), where V is a finite dimensional 
nonzero left G-module on which Z(G) acts as scalar multiplications 
through the character x, and Ey is the vector bundle over X associated 
to the principal G—bundle Eq for the G—module V. 


Definition 2.2. Let G be any affine group scheme defined over k. A princi- 
pal G-bundle Eg over a geometrically irreducible smooth projective curve 
X will be called strongly semistable if for any indecomposable finite dimen- 
sional left G-module V € G—mod, the vector bundle over X associated to 
Eg for V is strongly semistable. 


See [13], [12] for the definition of a (strongly) semistable principal bun- 
dle with a reductive group as the structure group. We will show that the 
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above definition coincides with the usual definition a (strongly) semistable 
principal G—bundle when G is reductive. For that we will need the follow- 
ing theorem. We note that Definition 2.2 is justified by the following lemma 
which is proved in [6]. 


Lemma 2.2. Let H be a reductive linear algebraic group and X a ge- 
ometrically irreducible smooth projective curve defined over k. A principal 
H-bundle Ey over X is strongly semistable if and only if for every indecom- 
posable H—module V, the vector bundle Ey = Exy(V) over X associated 
to the principal H—-bundle Ey for V is strongly semistable. 


Let Eq be a strongly semistable principal G—bundle over X. To each 
G-module we will associate an object of the neutral Tannakian category 
Cx that we constructed in Section 2.3. 

Let V be a finite dimensional left G-module. We noted earlier that each 
V, in Eq. (2.17) is a G-module. Let Ey, be the vector bundle over X 
associated to the principal G—bundle Eg for the above G—module V,. 

The following lemma is proved in [6]. 


Lemma 2.3. The vector bundle Ev, is strongly semistable, and if V. # 0, 
then 


MEv,) = be6(X); 


where Oz, is the homomorphism constructed in Corollary 2.2. 
Lemma 2.3 has the following corollary: 


Corollary 2.3. For any X € Q and any V € G-mod, the direct sum 
EX(V) := cS Ey, 
{x€Zo(G)*|dzq (x)=A} 
is either zero, or it is a strongly semistable vector bundle with 
w(EQ(V)) = 2. 
As before, G-mod denote the category of finite dimensional G—modules. 


As before, by Vect(X) we will denote the category of vector bundles over 
X. Consider the function 


frav :Q— Vect (X ) 
defined by 
fo.v(A) := EA(V), (2.18) 
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where V € G~mod, and E4(V) is defined in Corollary 2.3. From Corollary 
2.3 it follows immediately that this function fz,,v is an object of the cat- 
egory Cx constructed in Section 2.3. Therefore, to each object of G-mod 
we have associated an object of Cx. 

Let Cz, denote the subcategory of Cx defined by all objects f of Cx such 
that there exists some V € G~—mod with the property that f is isomorphic 
to a sub-quotient of fzg,v, where fro,v is the object of Cx constructed 
from V in Eq. (2.18). The morphisms remain unchanged. In other words, 
for any two objects f and f’ in Cg,, the morphisms from f to f’ are the 
morphisms from f to f’ considered as objects of Cx. 

It is straight-forward to check that Cz, is a neutral Tannakian subcat- 
egory of Cx. Therefore, the neutral Tannakian category Cz, gives an affine 
group scheme defined over k (see Theorem 2.1). 


Definition 2.3. The affine group scheme defined over k given by the neu- 
tral Tannakian category Cz, will be called the monodromy group scheme 
of Eg. The monodromy group scheme of Eg will be denoted by M. 


Since Cg, is a Tannakian subcategory of Cx, the monodromy group 
scheme M is a quotient of the group scheme Gx constructed in Section 2.3 
(see [7, Proposition 2.21]). Let 


dpe Gx — M (2.19) 


be the quotient map. 
Just as we have the tautological Gx—bundle Eg, , there is a tautological 
principal M—bundle over X. 


Definition 2.4. Let Ey, denote the tautological principal /—bundle over 
X. This principal M—bundle Eyy will be called the monodromy bundle for 
Eg. 


The principal M—bundle Ey, is evidently the one obtained by extending 
the structure group of the principal Gx—bundle Eg, using the homomor- 
phism ¢z, in Eq. (2.19). 

We will next show that there is a tautological embedding of the mon- 
odromy group scheme M into the fiber, over the fixed k—rational point 
x € X, of the adjoint bundle for Eg. 

Let Ad(E@) be the adjoint bundle for the principal G—bundle Eg over 
X. Let Ad(Eg)z be the fiber of Ad(E@) over the fixed k-rational point x 
of X. 
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If w is the fiber functor for the principal G-bundle Eg over X, then 
the group Ad(Eg),z defined over k represents the functor Aut®(w). Using 
Theorem 2.11 in [7, p. 130], we get a natural homomorphism from the group 
scheme Gx (constructed in Section 2.3) to Ad(E@).«. Let 


®(Eg) : Gx ah Ad(E@)z (2.20) 


be this natural homomorphism. 

It is easy to see that the image of the homomorphism ®(Eg) in 
Eq. (2.20) coincides with the monodromy group scheme M in Definition 
2.35 

Therefore, we have the following proposition: 


Proposition 2.2. The monodromy group scheme M for Eg (introduced in 
Definition 2.3) is identified with the image of the homomorphism ®(Eq@) in 
Eq. (2.20). In other words, the kernel of the homomorphism ®(Eq@) coin- 
cides with the kernel of the homomorphism og, in Eq. (2.19). 

There is a natural inclusion M — Ad(Eq)z obtained from the fact that 
the quotients of Gx for the two homomorphisms ®(Eq) and dg, coincide. 


We will now investigate the behavior of the monodromy group and the 
monodromy bundle under the extensions of structure group. 
Let 


p28 + (2.21) 


be an algebraic homomorphism between linear algebraic groups defined 
over k. Let Z(G) denote the (unique) maximal split torus contained in 
the reduced center of G,. We assume the following: 


e The group G; does not admit any nontrivial character which is 
trivial on Z(G). 
e The homomorphism p in Eq. (2.21) satisfies the condition 


P(Zo(G)) C Zo(Gr). (2.22) 
The following lemma in proved in [6]. 


Lemma 2.4. Let Eq be a strongly semistable principal G-bundle over 
X. Then the principal G,-bundle Eg, := Ea(G1), obtained by extending 
the structure group of Eg using p (defined in Eq. (2.21)) is also strongly 
semistable. 


Let Eg be a strongly semistable principal G—bundle over X. Hence 
by Lemma 2.4, the principal G—bundle Eg,, obtained by extending the 
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structure group of the principal G-bundle Eg using the homomorphism p, 
is also strongly semistable. The homomorphism p in Eq. (2.21) induces a 
homomorphism of group schemes 


p: Ad(Eg) — Ad(Ec,) (2.23) 


over X. 
The following lemma is proved in [6]. 


Lemma 2.5. The monodromy group scheme M, C Ad(Ecg,)« for Ec, 
is the image p(x)(M), where p(x) is the homomorphism in Eq. (2.23) re- 
stricted to the k-rational point x of X, and M Cc Ad(E@)z is the mon- 
odromy group scheme of Eg (see Proposition 2.2). Furthermore, the mon- 
odromy bundle Ey, for Eg, is the extension of structure group of the mon- 
odromy bundle Ex, for Eq by the homomorphism M — + My, obtained by 
restricting p(x). 


2.5. More on Monodromy 


For a subgroup scheme H C G, let (H()Zo(G))rea denote the reduced 
intersection; let (H () Zo(G))o denote the (unique) maximal split torus con- 
tained in the abelian group (H () Z0(G))rea- 

As before, let Eg be a principal G—bundle Eg over the curve X. 


Definition 2.5. A reduction of structure group 
Ey C Ee (2.24) 


of Eg to a subgroup scheme H C G will be called balanced if for every 
character y of H trivial on (H()Zo(G))o C H (see the above definition), 
we have 


degree(Fin(x)) = 0, 


where F7(x) is the line bundle over X associated to the principal H—bundle 
Ey for the character yx. 


Since any character of (H () Zo(G))rea/(H (1) Z0(G))o is of finite order, 
if a character y of H is trivial on (H() Zo(G))o, then there is a positive 
integer n such that the character y" of H is trivial on (H () Zo(G))rea. 
Therefore, a reduction of structure group Ey C Eg as in Definition 2.5 is 
balanced if and only if for every character y of H trivial on (H (} Zo(G))rea 
we have 


degree(Exn(x)) = 0. 
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Since the quotient (H()2Z0(G))/(H() Z0(G))rea_ is a finite group 
scheme, if a character x of H is trivial on (H (| Zo(G))rea, then there is a 
positive integer n such that the character y” of H is trivial on H (] Z(G). 
Therefore, a reduction Ey C Eg as in Definition 2.5 is balanced if and 
only if for every character x of H trivial on H (Z(G) we have 


degree(Exn(x)) = 0. 
The following proposition is proved in [6]. 


Proposition 2.3. Let Eg be a strongly semistable principal G-bundle over 
X and Ey C Eg a balanced reduction of structure group of Eg to a sub- 
group scheme H C G. Then the principal H—bundle Ey over X is strongly 
semistable. 


Let Eg be a strongly semistable principal G—bundle over X. In Propo- 
sition 2.2 we saw that the monodromy group scheme M (constructed in 
Definition 2.3) is canonically embedded in Ad(E£g@),. For national conve- 
nience, we will denote by G the group Ad(Eq)z defined over k. Let Ea be 
the principal G-bundle over X obtained by extending the structure group 
of the monodromy bundle Ey, (see Definition 2.4) using the inclusion of 
in G. Therefore, 


Emu Cc ke (2.25) 


is a reduction of structure group of Ee to M. 
Let Z(G) denote the unique maximal split torus contained in the re- 
duced center of G. 


The following theorem is proved in [6]. 


Theorem 2.3. Assume that the group G := Ad(Eq)z does not admit 
any nontrivial character which is trivial on Zo(G). Then the reduction of 
structure group in Eq. (2.25) is a balanced reduction of structure group of 


Ee to M. In particular, the principal M—bundle Ey is strongly semistable. 


Assume that the fiber of the principal G-bundle Eg, over the k—-rational 
point x of X, admits a rational point. If we fix a rational point in the fiber 
of Eg over x, then G gets identified with G, and the principal bundle Ee 
gets identified with Eg. 

If Ey C Eg is a reduction of structure group, to a subgroup scheme 
H C G, of a principal G-bundle Eq over X, then the adjoint bundle 
Ad(Ey#) is a subgroup scheme of the group scheme Ad(E@) over X. 
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The following theorem is proved in [6]. 


Theorem 2.4. Let Eg be a strongly semistable principal G-bundle over a 
geometrically irreducible smooth projective curve X defined over k, where G 
is a linear algebraic group defined over k with the property that G does not 
admit any nontrivial character which is trivial on Zo(G). Fix a k-rational 
point « of X. Let H C G be a subgroup scheme and Ey C Eg a balanced 
reduction of structure group of Eg to H. Then the image in Ad(Eq). of the 
monodromy group scheme M (image by the homomorphism in Proposition 
2.2) is contained in the subgroup scheme Ad(Ey), C Ad(E@)zx. 


More properties of the monodromy group and the monodromy reduction 
can be found in [5]. 


2.6. Bundles on Higher Dimensional Varieties 


In this section we will assume k to be an algebraically closed field, but we do 
not put any assumptions on the characteristic of k. Let X be an irreducible 
smooth projective variety equipped with a very ample line bundle €. 

The degree of a torsionfree coherent sheaf F on X is defined to be the 
degree of F' restricted to the general complete intersection curve obtained 
by intersecting hyperplanes from the complete linear system |é]. 

A torsionfree coherent sheaf F over X is called semistable (respectively, 
stable) if the inequality Eq. (2.6) (respectively, Eq. (2.7)) holds for all 
coherent subsheaves V C E with 0 < rank(V) < rank(£). A semistable 
sheaf is called polystable if it is a direct sum of stable sheaves. 

As before, a semistable vector bundle EF is called strongly semistable if 
its iterated pullbacks by the self-map F’x of X are all semistable. We recall 
Fx is the Frobenius map of X when the characteristic of k is positive, and 
it is the identity map of X when the characteristic of k is zero. 

Let E be astrongly semistable vector bundle over X. There is a filtration 
of coherent subsheaves 


= Yew c=. €¢ Ves chy & 


such that each successive quotient V;/V;-1, 7 € [1,n], is polystable, and 
w(Vi/Vi-1) = pw(V). More precisely, V;/Vi-1 is the maximal polystable 
subsheaf of E'/V;_1, which is also called the socle of E/Vj_1. 

Let 


Up CX (2.26) 
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be the maximal Zariski open dense subset such that each V;/V;-1, i € 
[1 ,n], is locally free. It can be shown that E’ possesses a monodromy group 
as well as a monodromy bundle over the open subset Up. 

Compared to the case of curves, the construction of the monodromy 
group and the monodromy bundle for higher dimensional X is technically 
more complicated. 

Let G be a connected reductive linear algebraic group defined over k. Let 
Eg be a strongly semistable principal G—bundle over X. Consequently, the 
adjoint vector bundle ad(£a) is strongly semistable ([12, p. 288, Theorem 
3.29] ) 

Fix a point  € Usa(eg), Where Usa(z,) is constructed as in Eq. (2.26). 
There is a canonical monodromy group H Cc Ad(E@)z as well as a mon- 
odromy reduction over the open subset U,q(eg)- 
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A permutation group G (acting on a set 2, usually infinite) is said to 
be oligomorphic if G has only finitely many orbits on 2” (the set of n- 
tuples of elements of Q). Such groups have traditionally been linked with 
model theory and combinatorial enumeration; more recently their group- 
theoretic properties have been studied, and links with graded algebras, 
Ramsey theory, topological dynamics, and other areas have emerged. 

This paper is a short summary of the subject, concentrating on 
the enumerative and algebraic aspects but with an account of group- 
theoretic properties. The first section gives an introduction to permuta- 
tion groups and to some of the more specific topics we require, and the 
second describes the links to model theory and enumeration. We give 
a spread of examples, describe results on the growth rate of the count- 
ing functions, discuss a graded algebra associated with an oligomorphic 
group, and finally discuss group-theoretic properties such as simplicity, 
the small index property, and “almost-freeness” . 


3.1. Introduction 


Despite the history and importance of group theory, we have very little 
idea what an arbitrary group looks like. We have made important strides 
in understanding finite groups, by determining the finite simple groups; 
but we can only study general groups under some very strong condition, 
usually a finiteness condition. We have theories of finitely generated groups, 
locally finite groups, residually finite groups, groups of finite cohomological 
dimension, linear groups, profinite groups, and so forth, but no theory of 


general groups. 
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Oligomorphic groups satisfy a rather different kind of finiteness condi- 
tion; paradoxically, one which makes them “large” rather than “small”. A 
permutation group G (a subgroup of the symmetric group on a set {) is 
said to be oligomorphic if G has only finitely many orbits on 2” for every 
natural number n. (An element of G acts componentwise on the set 2” of 
all n-tuples of points of 2.) 

Thus, by definition, an oligomorphic group G gives us a sequence of 
natural numbers, the numbers of orbits on n-tuples for n = 0,1,2,.... 
Not surprisingly, the theory is connected with counting problems in vari- 
ous parts of mathematics (combinatorics, model theory, graded algebras). 
Curiously, there is rather less we can say about the groups themselves. If 
G is oligomorphic and H is a proper subgroup having the same orbits on 
n-tuples as G for all n, then counting cannot distinguish between G and 
H, even though they may be very different as groups (for example, G may 
be simple while H is a free group). 

In the remainder of this section, we introduce some basics of permuta- 
tion group theory and of the counting functions associated with oligomor- 
phic groups. For further information about permutation groups, see [7, 14]. 
Note also that there are many connections between parts of the theory 
of oligomorphic groups and that of (combinatorial) species, as developed 
by [21]. 


3.1.1. Permutation groups 


This section is a brief introduction to permutation groups. For more details, 
see [7]. 

The symmetric group Sym(Q) on a set 2 is the group of all permutations 
of Q. If O is finite, say Q = {1,2,...,n}, we write the symmetric group as 
S,,. We write permutations on the right, so that the image of a under g is 
written ag. 

An orbit of G is an equivalence class of the relation a ~ @ if ag = G for 
some g € G; in other words, a set of the form {ag : g € G}. We say that G 
is transitive if it has only one orbit. In a sense, any permutation group can 
be “resolved” into transitive groups. 

There is a partial converse. If G; is a transitive permutation group on 
Q; for i € I, where J is some index set, the cartesian product [J,-; Gi is 
the set of functions f : J — U,-, Gi satisfying f(7) € G; for all i € J. It 
has two natural actions: 


e The intransitive action on the disjoint union of the sets Q;: if a € Q;, 
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then af = af (i). If each group G; is transitive, then the sets Q; are 
the orbits of the cartesian product. 

e There is also a product action, componentwise on the cartesian prod- 
uct of the sets Q;. 


If I is finite, we speak of the direct product, and write it as (for example) 
Gy 4 eR Ge, Its Bh 

Thus, if G; = S, and Gy = S»,, then G; x G2 has an intransitive action 
on m+n points, and a product action on mn points (which can be regarded 
as a rectangular grid with G; permuting the rows and G2 the columns). 

Now let G be transitive on 2. A congruence is an equivalence relation 
on 2 which is G-invariant. There are two “trivial” congruences on 2: the 
relation of equality, and the “universal” relation with a single equivalence 
class. We say that G is primitive if there are no other congruences. For 
example, the symmetric group is primitive. 

Important examples of imprimitive groups are the wreath products, de- 
fined as follows. Let H be a permutation group on I, and K a permutation 
group on A. Let 2 =T x A, regarded as a union of copies of I’ indexed by 
A: thus T's = {(7,6) : y € T}, for each 6 € A. Take a copy H; of H for each 
6 € A, where H; acts on I's. Then the cartesian product B = [[;., Hs 
acts on Q) (in its intransitive action). Moreover, the group K acts on 2 by 
permuting the second components (i.e. by permuting the “fibres” T's. The 
wreath product G = HWrK is the group generated by B and K; we call B 
the base group (it is a normal subgroup) and K the top group of the wreath 
product. 

We note that there are different notations and conventions in other areas 
of mathematics. For example, in experimental design in statistics (cf. [2]), 
direct product (product action) is called crossing, and wreath product is 
called nesting, but nesting is written with the top structure before the 
bottom one, e.g. A/T in our case. 

If || > 1 and |A| > 1, the wreath product is imprimitive: the relation 
(y,6) = (9’,6’) if 6 = 0’ is a congruence. For this reason it is called the 
imprimitive action. Any imprimitive permutation group can be embedded 
in a wreath product in a natural way. 

There is another action of the wreath product, the power action, on 
the set I of functions from A to I. It bears a similar relation to the 
imprimitive action as the product action does to the intransitive action for 
the cartesian product. If we regard the domain of the imprimitive action 
as a fibred space, with fibres ‘5 isomorphic to [ indexed by A, then the 
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domain for the power action is the set of global sections (subsets containing 
one point from each fibre). 


3.1.2. Oligomorphic permutation groups 


For a natural number n, a permutation group is n-transitive if it acts tran- 
sitively on the set of n-tuples of distinct elements of Q, and n-set-transitive 
if it acts transitively on the set of n-element subsets of 2. We say a per- 
mutation group is highly transitive if it is n-transitive for all n, and highly 
set-transitive if it is n-set-transitive for all n. 

Oligomorphic permutation groups generalise these classes. Thus, we let 
F,,(G) denote the number of orbits of G on the set of n-tuples of distinct 
elements, and f,(G) the number of orbits on n-element subsets. So G is 
n-transitive (resp. n-set-transitive) if F,,(G) = 1 (resp. fn(G) = 1). If the 
group G is clear, we drop it and write simply F),, fn. 

The definition speaks of orbits on 2", the set of all n-tuples. Let F'*(G) 
denote the number of these orbits. It is clear that, for given n, one of these 
numbers is finite if and only if the others are; indeed, we have 


e f= » S(n,k) Fy, where S(n,k) is the Stirling number of the sec- 
k=1 
ond kind (the number of partitions of an n-set into k parts); 


e in Ey = Tee 


As an example for the first point, if G is highly transitive, then 
Fi(G) = S S(n,k) = B(n), 
k=0 


the nth Bell number (the number of partitions of an n-set). 
In the second point, both bounds are attained: 


e Let G = Sym(Q) (we will always denote this group by S$). Then 
Junio) = (5) = 1 for all x, 

e Let G be the group of order-preserving permutations of the rational 
numbers (we will always denote this group by A). We can map any 
n rational numbers in increasing order to any other n such by an 
order-preserving permutation (by filling in the gaps to produce a 
piecewise-linear map); so f,(A) = 1 and F,,(A) = nl. 


Highly set-transitive groups must resemble the above types. All we can 
do is to modify the total order slightly. The following is proved in [3]. 
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Theorem 3.1. Let G be a highly set-transitive but not highly transitive 
permutation group. Then there is a linear or circular order preserved by G. 
In particular, G is not 4-transitive. 


3.1.3. Topology 


We will only consider permutation groups of countable degree in this ar- 
ticle. If we are interested in the sequences f, and F,, this is justified by 
the following result, which follows from the Downward Lowenheim—Skolem 
Theorem of model theory: 


Proposition 3.1. Let G be an oligomorphic permutation group on an infi- 
nite set. Then there is an oligomorphic permutation group G’ on a countably 


infinite set such that F,,(G’) = F,(G) and fn(G’) = fn(G) for alln EN. 


There is a natural topology on the symmetric group of countable de- 
gree. This is the topology of pointwise convergence, where a sequence (g,) 
converges to g if ajgn = aig for all sufficiently large n, where (a1, Q2,...) is 
an enumeration of 2. This topology can be derived from a complete metric. 
The topology is specified by the first part of the proposition below. 


Proposition 3.2. 


(a) G is open in Sym(Q) if and only if it contains the pointwise stabiliser 
of a finite set. 

(b) G is closed in Q if and only if G is the automorphism group of a 
relational structure on Q, that is, a family of relations (of various 
arities) on Q. 


The closure of a permutation group G consists of all permutations which 
preserve the G-orbits on (” for all n. We remarked earlier that, as far as 
orbit-counting goes, we cannot distinguish between groups with the same 
orbits. The largest such group is necessarily closed in Sym(Q). So we may 
restrict our attention to closed groups of countable degree (that is, auto- 
morphism groups of countable relational structures) if we are interested in 
orbit-counting. 

The fact that the topology on the symmetric group is derived from a 
complete metric means that the same is true for any closed subgroup. This 
permits use of the Batre category theorem. Recall that a subset of a complete 
metric space is residual if it contains the intersection of countably many 
open dense subsets. The Baire category theorem asserts that a residual 
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set is non-empty. Indeed, residual sets are “large” (for example, they have 
non-empty intersection with any open set, and the intersection of countably 
many residual sets is residual). Often it is possible to give a non-constructive 
existence proof for some object by showing that objects of the required type 
form a residual set. 


3.1.4. Cycle index 


An important tool in combinatorial enumeration is the cycle index of a 
finite permutation group G, which is the polynomial in the indeterminates 
$1, 82,---,Sn (where n is the — given by 


LAGs 845 $5, 603584) = iG =e gt c1(9) g62() , ; - gon (9) 


gEG 


where c;(g) is the number of i-cycles in the decomposition of the permuta- 
tion g into disjoint cycles. 

For its role in enumeration, see [18]. 

One cannot simply take this definition unchanged for infinite permu- 
tation groups, for several reasons: the number c;(g) may be infinite; there 
may be infinite cycles; and the denominator |G| is infinite. 

The trick to generalising it lies in the following result about finite groups, 
the Shift Lemma. Let PQ be the set of all subsets of the finite set Q, and let 
PQ/G denote a set of representatives of the G-orbits on PQ. Also, if X is 
any subset of Q, we let G[X] denote the permutation group on X induced 
by its setwise stabiliser in G. 


Proposition 3.3. For a finite permutation group G on Q, 


>> Z(G[X]} 51, 82,...) = Z(G 81 +: 1,82 +1,...). 
XEPQ/G 


Now let G be any oligomorphic permutation group on the (possibly 
infinite) set 2. We define the modified cycle index of G to be the left-hand 
side of the Shift Lemma, with one small modification: we replace PQ by 
Pan, the set of all finite subsets of 2. Thus, the modified cycle index is 


Z(G; 81, $2,...) = >, ZG X |s8i5.835¢2<) 


X EP gn Q/G 


Each term in the sum is the cycle index of a finite permutation group. 
Also since G is oligomorphic, there are only a finite number of terms in 
the sum which contribute to the coefficient of a fixed monomial s{' -- - s¢ 


a 
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namely those corresponding to the f,(G) orbits on sets of cardinality n = 
C1 + 2cg +---+1rc,. (We see here that the definition would fail if G were 
not oligomorphic.) So we have defined a formal power series in 51, S2,.... 
We also see that if G happens to be a finite permutation group, then we 
have the ordinary cycle index with 1 added to each indeterminate. 

Our previous counting functions can be extracted from the modified 
cycle index: 


Proposition 3.4. Let G be an oligomorphic permutation group. 


e Fa(x) is obtained from Z(G) by substituting s; = x and s; = 0 for 
o> Le 
e fa(x) is obtained from Z(G) by substituting s; = x* for all i. 


Just as before, it is true that 


e for any oligomorphic permutation group G, there is an oligomorphic 
group G’ of countable degree satisfying Z(G) = Z(G’); 

e an oligomorphic group of countable degree and its closure have the 
same modified cycle index. 


So we may consider closed groups of countable degree. 

We conclude this section by displaying the modified cycle index for 
the groups S (the infinite symmetric group) and A (the group of order- 
preserving permutations of Q). 


Proposition 3.5. 


3.2. Connections 


Oligomorphic permutation groups are closely connected with two other ar- 
eas of mathematics: model theory, and combinatorial enumeration. In this 
section we discuss the connections. 
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3.2.1. Model theory 


Model theory describes structures consisting of a set with a collection of 
constants, relations and functions. Much of mathematics can be fitted into 
this framework, often in different ways. For example, a group has a binary 
operation (composition), a unary operation (inversion), and a constant (the 
identity); but the second and third may be defined in terms of the first. A 
graph can be regarded as a set of vertices with a binary relation (adjacency), 
or as a set of vertices and edges with a unary relation (to distinguish the 
vertices) and a binary operation of incidence. The latter is appropriate for 
multigraphs. 

Logic describes such structures by means of collections of formulae. The 
language includes symbols for the relations, functions, and constants, con- 
nectives and quantifiers, equality, brackets, and a supply of variables. We 
work in first-order logic: we are allowed to combine finitely many formulae 
with connectives, and to quantify over variables which range over the un- 
derlying set. I will assume that the language is countable. A sentence is a 
formula with no free variables (all variables are quantified). Thus for exam- 
ple the group axioms can be expressed as a single sentence (the conjunction 
of the associative, identity and inverse laws, each universally quantified over 
all variables). 

A theory is a set of sentences; a structure is a model for a theory if every 
sentence in the theory is true in the model. 

Sometimes we want a theory to have a wide range of models (this is the 
case with group theory). At other times, we have a particular model in mind, 
and want to capture as much of its essence as possible in a theory (this is 
the case with the Peano axioms for the natural numbers). It is known that, 
as long as a theory has infinite models, it cannot have a unique model; there 
will be models of arbitrarily large cardinality. The best we can do is ask 
that the theory is a-categorical, where a is an infinite cardinal, meaning 
that there is a unique model of cardinality a@ up to isomorphism. By a 
theorem of Vaught, there are only two types of categoricity, countable and 
uncountable: if a theory is a-categorical for some uncountable cardinal a, 
then it is a-categorical for all such. 

Uncountable categoricity gives rise to a powerful structure theory, ex- 
tending that for vector spaces over a fixed field or algebraically closed 
fields of fixed characteristic (where a single invariant, the rank, deter- 
mines the model). By contrast, countable categoricity is related to sym- 
metry, by the following remarkable theorem due independently to Engeler, 
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Ryll-Nardzewski and Svenonius: 


Theorem 3.2. Let M be a countable model of a theory T over a countable 
language. Then T is Xo-categorical if and only if the automorphism group 
of M is oligomorphic. 


In fact, if M is the unique countable model of T, and G = Aut(M), 
then F(T) is the number of n-types over T (maximal consistent sets of 
formulae in n free variables). So our results are applicable to counting types 
in such theories. 

Note in passing that the automorphism group of N is trivial; so the 
Peano axioms have countable “non-standard” models. 


3.2.2. Combinatorial enumeration 


The set-up is similar, but we restrict ourselves to relational structures (no 
function or constant symbols). We will see that the counting problems for 
orbits on sets and tuples of oligomorphic permutation groups are identi- 
cal with those for unlabelled and labelled structures in so-called oligomor- 
phic Fraissé classes of relational structures. These include large numbers of 
combinatorially important classes of structures, so we have a rather general 
paradigm for interesting counting problems. 

Take a fixed relational language L (this means, each set carries named 
relations of prescribed arities). For example, we could consider graphs, di- 
rected graphs, tournaments, partially ordered sets, k-edge-coloured graphs, 
graphs with a fixed bipartition, or much more recondite examples. 

A Fraissé class over L is a class C of finite relational structures over L 
satisfying the following four conditions: 


(a) C is closed under isomorphism; 

(b) C is closed under taking induced substructures (this means, take a 
subset of the domain, and all instances of all relations which are 
contained within this subset); 

(c) C has only countably many members up to isomorphism; 

(d) C has the amalgamation property: this means that, if B,, Bo € C 
have isomorphic substructures, they can be “glued together” along 
these substructures (or possibly more) to form a structure in C. 
Formally, if A,B,,Bz € C and f; : A — B; is an embedding for 
i = 1,2, then there is a structure C' € C and embeddings h; : B; — C 
for i= 1,2 so that hi f; = hafo. 
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(The reader is warned that I allow the structure A to be empty here. Some 
authors exclude this, state the special case where A = () separately, and 
call it the joint embedding property.) 

For a simple example, finite graphs form a Fraissé class: take the union 
of B, and By identifying the isomorphic substructures, and put any or no 
edges between B, \ A and Bz \ A. It is just a little more difficult to show 
that finite total or partial orders form a Fraissé class. 

Let M be a structure over L. We make two definitions: 


e The age of M is the class of all finite L-structures which are embed- 
dable in M. 

e M is homogeneous if any isomorphism between finite induced sub- 
structures of M/ can be extended to an automorphism of M. 


For example, the totally ordered set Q is homogeneous, and its age consists 
of all finite totally ordered sets. 
Fraissé’s Theorem asserts: 


Theorem 3.3. A class C of finite L-structures is the age of a countable 
homogeneous L-structure M if and only if it is a Fraissé class. If these 
conditions hold, then M is unique up to isomorphism. 


We call the countable homogeneous structure M the Fraissé limit of the 
class C. Thus, (Q, <) is the Fraissé limit of the class of finite totally ordered 
sets. The Fraissé limit of the class of finite graphs is the celebrated random 
graph, or Rado graph R, which is extensively discussed in [4]. 

Now suppose that a Fraissé class satisfies the following stronger version 
of condition (c): 


(c’) For any n €N, the class C has only finitely many n-element members 
up to isomorphism. 


We call such a class an oligomorphic Fraissé class. All examples mentioned 
so far are oligomorphic; the condition certainly holds if L contains only 
finitely many relations. Let M be its Fraissé limit, and let G = Aut(M) be 
the automorphism group of M. By homogeneity, it follows that 


e F,(G) is the number of labelled n-element structures in C (that is, 
structures on the set {1,2,...,n}; 

e fn(G) is the number of unlabelled n-element structures in C (that 
is, isomorphism classes of n-element structures). 
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Some Fraissé classes satisfy a stronger version of the amalgamation prop- 
erty, called strong amalgamation. This is said to hold if the amalgam of any 
two structures can be produced without identifying any points not in the 
common substructure. More formally, if hi(b1) = h2(b2), then b; = fi(a) 
and bo = fo(a) for some a € A. 

This holds in the above examples. An example where it fails is given 
by the class of graphs consisting of isolated vertices and edges. If B, and 
Bo are edges and A a single vertex, then in the amalgam we are forced to 
identify the other ends of the two edges as well. 


Proposition 3.6. A Fraissé class has the strong amalgamation property 
if and only if the automorphism group of its Fraissé limit has the property 
that the stabiliser of any finite set of points fixes no additional points. 


Later we will see a still stronger version. 


3.3. Constructions 


There are two main sources for examples of (closed) oligomorphic groups: 
building new examples from old, or constructing Fraissé classes (the group 
is then the automorphism group of the Fraissé limit of the class). 


3.3.1. Direct and wreath products 


Suppose that G; and G2 are oligomorphic permutation groups on 2; and 
Q2 respectively. We can form their direct or wreath product, and each of 
these has two actions, which we now discuss. 


Direct product, intransitive action. The direct product G = G, x G2 
acts on the disjoint union of Q; and Qe, say Q. An n-subset of 2 consists of 
k points of Q; and n—k points of Qe, for some k with 0 < k < n; two n-sets 
are in the same G-orbit if and only if their intersections with Q; are in the 
same G;-orbit for 7 = 1,2. Thus (f,(G)) is the convolution of (f(G1)) and 


(Ta (G2)): 
frn(@) = S> fe(G1) fn—#(Ga), 
k=0 


from which it follows that the generating functions multiply: 


fa(x) — ja, (@)fe, (x). 
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Similarly, for F,, we have an exponential convolution, so that the exponen- 
tial generating functions also multiply: 


Fo(x) = Fe, («) Fa, (2). 
The modified cycle index is also multiplicative: 


Z(G) = Z(G1)Z(G2). 


Direct product, product action. The direct product G; x G2 also has 
a product action on the cartesian product 2) x Q2. This is more difficult 
to analyse; the recent paper [9] describes the situation. 

First, there is a multiplicative property: it is easy to see that 


Fy. (G) = Fy (G1) Fr (G2). 


The modified cycle index of the product can be computed as follows. 
Define an operation on the indeterminates by 


§, eS; = (Stem(a,3) ea? - 


Extend this operation multiplicatively to monomials and then additively to 
sums of monomials. Then we have 


Z(G) = Z(G) e Z(G2). 


It should clearly be possible to deduce the first of these relations from 
the second; but this is surprisingly difficult (see the cited paper). 

Here is an example. Consider first the group G = A, the order-preserving 
permutations of Q. We have F;,(G) = n!, and hence 


F*(G) = 5 Gia, Kiel = Pa), 
k=1 


the number of preorders of an n-set (reflexive and transitive relations). 
Now let G = A x A with the product action. We have F*(G) = P(n)?, 
and so 


nm 


F,(G) = 55 s(n, k)P(k)?. 


k=1 


Moreover, since G (like A) has the property that the setwise stabiliser of 
an n-set fixes it pointwise, we have 
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Now the exponential generating function for P(n) is 1/(2 — e”), with the 
nearest singularity to the origin at log2; so P(n) is roughly n!/(log 2)”. 
So F,(G) is about (n!)?/(log 2)?”. Since the Stirling numbers alternate in 
sign, it is not completely trivial to find the asymptotics of f,(G). This was 
achieved in [10], using three entirely different methods; it turns out that 
n! —1 (Joe 2)? 1 
fn(G) ~ 7° 5 (log 2) (ogy? 


In broad-brush terms, “factorial times exponential” . 


Wreath product, imprimitive action. The wreath product G,;WrG2 
has its imprimitive action on 24) x Q2; as we have seen, this should be 
thought of as a covering of Q2 with fibres isomorphic to Q,. The function 
F(a) is found by substitution: 
Fe(z) = em gen (x) ~ 1). 

In particular, for any oligomorphic group G, we have 

Fswra(x) = Fe(e” — 1) = FG(2), 
so that F,,(SWrG) = F*(G) for all n, as is easily seen directly. 

For another example, we note that Fewrs(x) = exp(fe¢(x) — 1). Thus, 
the substitution rule for the wreath product can be regarded as the proto- 
type of a wide generalisation of the exponential principle in combinatorics 
(cf. [16, 33]). If G is the automorphism group of the Fraissé limit of a Fraissé 
class C, then GWrS is similarly associated with the class of disjoint unions 
of C-structures; the exponential principle applies to this situation. In gen- 
eral, if H is associated with a Fraissé class D, then GWrH is associated 
with the class of disjoint unions of C-structures with a D-structure on the 
set of parts. For example, if H = A, then we have ordered sequences of C- 
structures; if H is the automorphism group of the random graph, we have 
a graph whose vertices are C-structures; and so on. 

The numbers f,,(G) of orbits on unordered sets cannot be obtained from 
the sequences (f,(Gi)) and fn(G2)) alone; we need the modified cycle index 
of the top group G2. The generating function fg¢(x) is obtained from Z (G2) 
by substituting fg, (a’) —1 for the variable s;, for i =1,2,.... 


Wreath product, power action. The wreath product also has a power 
action on OQ, the set of functions from Q2 to Q; (the set of transver- 
sals to the fibres in the imprimitive action). This action is not in general 
oligomorphic; it is so if G2 is a finite group. 
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It is shown in [9] that F*(G) is obtained from the (ordinary) cycle index 
of the finite group G2 by substituting F*(G)) for all of the variables s;. 


3.3.2. Other examples 


Further examples are most easily described as automorphism groups of 
relational structures. As we have seen, any closed oligomorphic group of 
countable degree is the automorphism group of a homogeneous relational 
structure, which is the Fraissé limit of a Fraissé class of finite structures. 
Sometimes the easiest way to specify the group is to give the Fraissé class. 


The random graph and some of its relations. Let R be the countable 
random graph. This graph is the Fraissé limit of the class of finite graphs. 
There are direct constructions for it, see [4]: for example, the vertices are 
the primes congruent to 1 mod 4, and p and gq are joined if and only if p 
is a quadratic residue mod q. Alternatively, as Erdos and Rényi showed, if 
we form a countable random graph by choosing edges independently with 
probability 1/2 from the 2-subsets of a countable set, the resulting graph 
is isomorphic to R with probability 1. 

The graph R is homogeneous, and contains all finite (and indeed all 
countable) graphs as induced subgraphs. So, if G = Aut(R), then F,,(G) 
is the number of labelled graphs on n vertices (which is 2”("~!)/?), while 
fn(G) is the number of unlabelled graphs on n vertices (which is asymptoti- 
cally F,(G)/n!). Note that these sequences grow so fast that the generating 
functions converge only at the origin. 

There are several related Fraissé classes. Here are a few examples. 


e In place of graphs, we can take directed graphs, or oriented graphs, 
or tournaments, or k-uniform hypergraphs. Note that the auto- 
morphism group of the random k-uniform hypergraph is (k — 1)- 
transitive but not k-transitive; so every degree of transitivity is pos- 
sible for infinite permutation groups. 

e In [34] it is shown that there are only five closed supergroups of 
Aut(R), namely 


— Aut(R); 
— The group D(R) of dualities (automorphisms and _anti- 


automorphisms) of R, which is 2-transitive and contains 
Aut(R) as a normal subgroup of index 2; 
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— The group S(R) of switching automorphisms of R (see below), 
which is 2-transitive; 

— The group B(R) of switching automorphisms and_anti- 
automorphisms of R, which is 3-transitive and contains $(R) 
as a normal subgroup of index 2; 

— the symmetric group. 


e In [24] the countable homogeneous graphs were determined. There 
are some trivial ones (disjoint unions of complete graphs of the same 
size, and their complements); some non-trivial ones, the Henson 
graphs and their complements; and the random graph. The Hen- 
son graph H,, is the Fraissé limit of the class of all finite graphs 
which contain no complete graph of order n, for n > 3. They are 
not very well understood. 


The operation ox of switching a graph with respect to a set X of vertices 
consists in replacing edges between X and its complement by non-edges, 
and non-edges by edges, while keeping things inside or outside X the same 
as before. A permutation g is a switching automorphism of T if [9 = ox (T) 
for some set X. Switching anti-automorphisms are defined similarly. 

One curious fact is that F,,(S(R)) and f,(S(R)) are equal to the num- 
bers of labelled and unlabelled even graphs on n vertices (graphs with all 
vertex degrees even), although the even graphs do not form a Fraissé class. 


Ordered structures. An old theorem of Skolem says that, if two subsets 
X and Y of Q are dense and have dense complements, then there is an order- 
preserving permutation of Q carrying X to Y. More generally, consider 
colourings of Q with m colours so that each colour class is dense. It is not 
hard to show that there is a unique such colouring up to order-preserving 
permutations. The automorpism group A,, of such a colouring (the group 
of permutations preserving the order and the colours) is oligomorphic, and 
fn(Am) = m”. For, if {ci,...,¢m} is the set of colours, then an n-set 
{1,---5Qn}, with q1 < +++ < qm, can be described by a word of length n 
in the alphabet {c1,...,¢m}, whose ith letter is the colour of q;; two sets 
lie in the same orbit if and only if they are coded by the same word, and 
every word arises as the code of some subset. 

We can modify this example in the same way we did for Q itself, allowing 
ourselves to preserve or reverse the order, or turning it into a circular order. 
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Treelike structures. There are vast numbers of treelike structures; I 
cannot give even a brief overview, so I will concentrate on a couple of 
examples. 

Consider the class of boron trees, that is, finite trees in which each 
vertex has degree 1 or 3. Consider any four leaves in such a tree. They are 
connected by a graph of the following shape: 


We see that one of the three partitions of the four leaves into two sets of two 
is distinguished by the fact that the paths joining vertices in the same pair 
do not intersect. Thus, there is a quaternary relation on the set of leaves. 
It is possible to show that this relation uniquely determines the boron tree, 
and that the class of finite structures carrying quaternary relations which 
arise in this way is a Fraissé class. So there is a corresponding oligomorphic 
group G, which is 3-transitive but not 4-transitive, and satisfies f,(G) ~ 
An-5/2¢", where c = 2.483.... 

This construction can be modified in many ways, of which a few are 
given below. 


e We may consider trees with degrees 1,3,4,...,m, or with any possi- 
ble degree except 2. (Divalent vertices are invisible in this construc- 
tion.) 

e We may consider internal vertices as well as leaves: there will be a 
ternary betweenness relation saying that one vertex is on the path 
joining the other two. 

e By embedding the trees in the plane, we may impose a circular order 
on the set of leaves. 


Many of these constructions give examples with exponential growth 
(roughly c” for some c > 1). 


Other examples. The symmetric group S on a countable set 9 has an 
induced action on the set of k-element subsets of 2, for any k. These groups 
are oligomorphic, but except in the case k = 2, not much is known about 
the asymptotics of the orbit-counting sequences (see [11] for the case k = 2). 

Other examples are linear and affine groups on infinite-dimensional vec- 
tor spaces over finite fields. See [12] for the asymptotics. 
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3.4. Growth Rates 


In this section we survey some known results about the rate of growth of 
the sequences (f;,(G)) and (F;,(G)) for an oligomorphic group G. 

First, note that there is no upper bound for growth rates. Given any 
sequence (a,,) of natural numbers, let L be a relational language containing 
dy, n-ary relations C be the Fraissé class consisting of all structures in which 
an n-ary relation holds only if all its arguments are distinct. If G is the 
automorphism group of the Fraissé limit, then f,(G) > 2°” for all n. Indeed, 
it is slow growth which is most interesting! 

For intransitive or imprimitive groups, we can have polynomial growth 
for (fn). For a simple example, if G is the direct product of r symmetric 
groups (with the intransitive action), then f,(G) = (“r 4 ‘), with 
generating function f¢(x) = (1—2)~”. 

For the same group, F;,(G) = r", since the orbit of an n-tuple is de- 
termined by the orbit containing each of its points. Recently it has been 
shown (cf. [1]) that, if G is transitive, and the point stabiliser has m orbits 
on the remaining points, then F;,(G) > m"~!. Equality is possible here for 
any m. 

There is a gap above polynomial growth for (f,). The next possible 
growth rate is fractional exponential, about exp(./n). This is realised by 
the group G = SWrS, for which we have f,(G) = p(n), the number of 
partitions of the integer n, with growth asymptotically 


Ta Perv 2M) 


and F;,(G) = B(n), the number of partitions of an n-set (the Bell number), 
whose growth is faster than exponential but slower than factorial. 

However, if we insist that the group is primitive, there is a dramatic 
change ([25, 29]): 


Theorem 3.4. There is a number c > 1 such that the following holds. 
Suppose that G is primitive but not highly set-transitive. Then 


e fn(G) > c"/q(n), for some polynomial q; 
e F,(G) > n!c"/q(n), for some polynomial q. 


In other words, for (f;,), there is a gap between constant and exponential 
growth! Merola’s proof gives the result with c = 1.324...; the best-known 
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examples have c = 2 (these are circular versions of the rationals partitioned 
into two dense subsets, that we met earlier). 
The main problems for exponential growth are: 


e prove that the “exponential constant” limy—.oo(fn)!/" always exists; 

e find the possible values it can take; 

e find a structural description of the examples where the growth is no 
faster than exponential. 


What happens just above exponential growth? Here are some examples. 


e Let C be the Fraissé class each of whose members is a set with a pair 
of total orders, and G the automorphism group of its Fraissé limit. 
Then f,(G) =n! and F,(G) = (n!)?. 

e The group SWrS2 with the power action is primitive. I do not know 
what the asymptotic behaviour of (f,(G)) is. As explained earlier, 
we find that 


F,(G) ~ 7 B(n)(B(n) + 1), 


where B(n) is the Bell number. 
e For the permutation group induced by S on the set of 2-element 
subsets of the domain, we have 


F,(G) ~ B(2n)2-"n7"/? exp (—[J log(2n/logn)}*) . 


No clear evidence of a gap emerges from this limited data. In [26] there 
are some theorems connecting growth just faster than exponential with 
model-theoretic properties such as stability and the strict order property. 

For the automorphism group of the random graph, the growth rate is 
about exp(cn?). For such growth, it doesn’t make a lot of difference whether 
we consider F;, or f,. We make one brief observation about this case. 


Proposition 3.7. Let G be the automorphism group of a homogeneous 
structure over a finite relational language. Then F,,(G) is bounded above by 
the exponential of a polynomial. 


The converse of this is not true. For example, the general linear group 
on a vector space of countable dimension over the field of two elements has 
F),(G) roughly gn’/ 16 but is not the automorphism group of a homogeneous 
structure over a finite relational language. (Take two n-tuples of vectors, 
the first linearly independent and the second satisfying the single linear 
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relation that the sum of the vectors is zero. These two n-tuples cannot be 
distinguished by relations of arity less than n.) 

An open problem here is to understand what the implications for G are 
of being the automorphism group of a homogeneous structure over a finite 
relational language. 


3.5. Graded Algebras 


Another part of mathematics where sequences of positive integers occur is 
the theory of graded algebras. Such an algebra is a direct sum 


A=@Q@A4n. 


neEN 


where the A, are vector spaces over a field F’, and, if v € Am and w € 
Am, then the algebra product vw belongs to A;,4+n. The subspaces A, are 
the homogeneous components. If they are all finite-dimensional, then the 
sequence of their dimensions (or its ordinary generating function) is the 
Hilbert series of the algebra. If a graded algebra is finitely generated, then 
the dimensions grow no faster than a polynomial in n, and so the Hilbert 
series converges inside the unit circle. 

We construct graded algebras as follows. First, let 2 be an infinite set, 
and F a field (for our purposes, always of characteristic zero). Let A, be 
ee (the set of n-element subsets of 2) 
to F’, and define a multiplication on the homogeneous components by the 
rule 


the vector space of functions from ( 


fo(X)= SO F(V)g(X\Y) 
ye(*) 

for f € An, g € Am, and X € ‘Ar Extended linearly to A = Q An, 
this multiplication is commutative and associative, and makes A a graded 
algebra. The constant function 1 on (°) = {} is the identity. (In fact 
Ag is 1-dimensional, spanned by the identity.) This algebra is the reduced 
incidence algebra of the poset of finite subsets of 2, but we make no use of 
this fact. 

The algebra A is much too large: its homogeneous components have 
infinite dimension (except for Ag which has dimension 1), and there are 
many nilpotent elements. 

But it has one important feature: if e € A, is the constant function on 
Q with value 1, then e is not a zero-divisor; that is, multiplication by e is a 
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monomorphism from A,, to An+1. (This algebraic statement is really a fact 
about finite combinatorics: the incidence matrix of n-sets and (n + 1)-sets 
of a set X of cardinality at least 2n + 1 has full rank.) 

Now let G be a permutation group on 2. There is a natural action of 
G on An, for all n; we let AG be the set of functions invariant under G 
(that is, constant on the G-orbits), and Af = @ AG. Then A is a graded 
subalgebra of A. Moreover, if G is oligomorphic, then dim(A®) = f,(G), 
so that the Hilbert series of A& is fa(x). 

There are various interesting examples of groups for which A® is finitely 
generated. For example, if G = G1 x Go in its intransitive action on Q) UQa2, 
where G; acts on Q; for i = 1,2, we have 


AS = AW @p A%, 


for any field of characteristic zero. In particular, A°** is freely generated 
by (that is, a polynomial ring in) two generators of degree 1, and more 
generally, A®’ freely generated by r generators of degree 1. 

If H is a finite permutation group of degree n, and G = SWrdH (in 
its imprimitive action), then A® is isomorphic to the ring of invariants of 
H (acting as a linear group via permutation matrices). In particular, if 
H = Sj, then A@ is the ring of symmetric polynomials in n variables, and 
is freely generated by the elementary symmetric polynomials e1,...,¢€n of 
degrees 1,2,...,n, by Newton’s Theorem. 

However, it follows from Macpherson’s Theorem that, if G is primitive 
but not highly homogeneous, then A% cannot be finitely generated. 

In the light of this, we are forced to look for other kinds of algebraic 
properties of A®. An example is a remarkable recent theorem of [30], con- 
firming a 30-year-old conjecture: 


Theorem 3.5. Let G be a permutation group on an infinite set Q. Then 
A® is an integral domain if and only if G has no finite orbits on Q. 


One direction is trivial. Suppose that A is a G-orbit of size n, and let 
f € An be the characteristic function of {A}. Then f € AG and f? = 0. 
The converse requires a new type of Ramsey-type theorem which is likely 
to have further applications. The result itself is applicable to growth rates, 
in view of the following result by [6], whose proof uses some easy dimension 
theory from algebraic geometry: 


Theorem 3.6. If G is oligomorphic and A® is an integral domain then 
tuae = tun + fn — 1. 
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Another situation in which we can describe the structure of A@ is when 
G is associated with a Fraissé class C (that is, G is the automorphism group 
of the Fraissé limit of C). In [5] it is shown that, if the class C has notions of 
connected components, disjoint union, and spanning substructures satisfy- 
ing a few simple axioms, then A® is a polynomial algebra whose generators 
are the characteristic functions of the connected structures in C. (More pre- 
cisely, for each isomorphism type of n-element connected structure in C, let 
f be the function in A, which takes the value 1 on n-sets whose induced 
substructure is of the given type and 0 elsewhere; the collection of all such 
functions is a set of free generators for A.) Here are a few, reasonably 
typical, examples. 


e Let C be the class of finite graphs, so that G is the automorphism 
group of the random graph R. Then a basis for the nth homogeneous 
component of A© consists the characteristic functions of all n-vertex 
eraphs (so that dim(A@) = f,,(G) is the number of unlabelled graphs 
on n vertices). Now the characteristic functions of connected graphs 
on n vertices are algebraically independent, and the set of all such 
elements forms a free generating set for A@, which is a polynomial 
algebra (infinitely generated). 

e We saw earlier that, if G is associated with a Fraissé class C, then 
GWTrS is associated with the class of disjoint unions of C-structures. 
There is an obvious notion of connectedness: structures with a sin- 


AGWFS is a polynomial algebra, with 


AGWrs 


gle part are connected! Thus, 
fn(G) generators of degree n for all n. (Thus, the structure of 
is independent of the structure of A? except for numerical informa- 
tion about dimensions of components.) 

e Consider the set Q with two kinds of structure: the order, and m 
subsets forming a partition of Q, each one dense in Q. (Think of a 
colouring of Q with m colours c1,¢2,...,¢m, so that each colour 
class is dense.) Let G be the group of permutations preserving 
the order and the colours. An orbit of G on n-sets is described 
uniquely by a word of length n in the alphabet {ci,c2,...,¢m}, 
recording the colours of the elements of the set in increasing order. 
Now the algebra A® is the shuffle algebra over an alphabet of size 
m, which occurs in the theory of free Lie algebras (cf. [32]). One 
can develop an appropriate notion of connectedness, so that the 
connected words are the so-called Lyndon words. The fact that the 
shuffle algebra is a polynomial algebra generated by the Lyndon 
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words was first proved in [31], but emerges as a special case of the 
theory presented here. 


3.6. Group Structure 


Relatively little is known about the structure of closed oligomorphic per- 
mutation groups, but the picture is rapidly changing. We look first at the 
normal structure in a couple of classical examples. 

The countable symmetric group S = Sym(Q) has a normal subgroup 
FSym(Q), the finitary symmetric group, consisting of all permutations moy- 
ing only finitely many points; this has a normal subgroup of index 2, the 
alternating group Alt(Q), consisting of finitary permutations which are even 
permutations of their supports. These are the only non-trivial proper nor- 
mal subgroups; in particular, the quotients are simple. 

The group A of order-preserving permutations of Q has two normal 
subgroups L and R; L consists of all permutations fixing all sufficiently 
large positive rationals, and R consists of permutations fixing all sufficiently 
large negative rationals. Their intersection consists of the order-preserving 
permutations of bounded support. These are the only non-trivial normal 
subgroups. In particular, A/(D R) = L/(LOR) x R/(LOR), and the two 
factors are isomorphic. 

The automorphism group of the countable random graph is simple (cf. 
[35]). Indeed, given any two elements g,h of this group with g ¥ 1, it is 
possible to write h as the product of three copies of g or g7!. 

An important property which has had a lot of attention is the small 
index property. Let G be a permutation group of countable degree. A sub- 
group H has small index if |G: H| < 2®°. (If the Continuum Hypothesis 
holds, this just says that H has finite or countable index.) The stabiliser of 
any finite set has small index. We say that G has the small index property 
if any subgroup of G of small index contains the pointwise stabiliser of a 
finite set; it has the strong small index property if every subgroup of small 
index lies between the pointwise and setwise stabilisers of a finite set. If 
G is a closed oligomorphic group with the small index property, then the 
topology of G is determined by the group structure: a subgroup is open 
if and only if it has small index, so the subgroups of small index form a 
neighbourhood basis of the identity. 

The symmetric group S, the group A of order-preserving permutations 
of Q, and the automorphism group of the random graph all have the strong 
small index property ((8, 15, 19]). A typical example of a permutation group 
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which does not have the strong small index property is SWrS, in its imprim- 
itive action; the stabiliser of a block of imprimitivity is not contained in the 
setwise stabiliser of any finite set. Examples which do not have the small 
index property can be constructed by producing automorphism groups of 
Fraissé limits which have infinite elementary abelian 2-groups as quotients; 
they have “too many” subgroups of small index! 

I mentioned earlier that the notion of Baire category is important for 
closed oligomorphic groups. Such a group G is said to have generics if there 
is a conjugacy class which is residual in G. In each of the three groups 
mentioned above, generics exist. Indeed, the nth direct power of the group 
has generics (in other words, the original group has generic n-tuples) for all 
n. This property is closely related to the small index property (see [19]). 

We can also ask what group is generated by a “typical” n-tuple of el- 
ements. John Dixon proved that almost all pairs of elements of the finite 
symmetric group S, generate S,, or the alternating group A,,. Later [13], 
he proved an analogue for the symmetric group of countable degree: almost 
all pairs of elements (in the sense of Baire category, that is, a residual set) 
generate a highly transitive free subgroup of S. (The existence of highly 
transitive free groups was first shown in [28].) 

As noted, very recently these results have been extended to wider classes 
of groups. 

Macpherson and Tent define the free amalgamation property, which is 
a strengthening of the strong amalgamation property, as follows. A Fraissé 
class C has the free amalgamation property if, whenever B, and Bo are 
structures in C with a common substructure A, there is an amalgam C of 
B, and Bez such that 


e the intersection of B, and Bp, in C is precisely A (this is the content 
of strong amalgamation, see Section 3.2); 
e Every instance of a relation in C is contained in either B, or Bo. 


This holds for graphs: we can choose to make the amalgam so that there 
are no edges between B, \ A and By \ A. 
They prove the following theorem: 


Theorem 3.7. Let C be a nontrivial Fraissé class (that is, there are some 
non-trivial relations) with the free amalgamation property, and G the auto- 
morphism group of its Fraissé limit. Then G is simple. 


The trivial case must be excluded, since then G is the symmetric group 
S, and as we have seen this group is not simple. 
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A Polish group (a topological group whose topology comes from a com- 


plete metric) is almost free if a residual subset of the n-tuples of elements 


of G freely generate a free group. Now [17] gives a number of equivalent 


characterisations of such groups, and show: 


Theorem 3.8. Closed oligomorphic groups are almost free. 


[23] give conditions for the existence of generic conjugacy classes in a 


closed oligomorphic group. 


In closing I mention a couple of important papers linking oligomorphic 


permutation groups, dynamical systems, and Ramsey theory: [20, 22]. 
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4.1. Introduction 


Mathematicians began to investigate the intrinsic complexity of subsets 
of the real line and of more general spaces about a century ago. Souslin’s 
discovery that the projection on the line of a Borel subset of the plane is not 
always Borel was an early but major success, and his separation theorem 
as well. When put together, his results say that a set which is defined with 
an existence quantifier on a real number (4 x € R such that ...) cannot in 
general be defined only with a sequence of quantifier on natural numbers, 
although it is so if this set can also be defined with a universal quantifier 
on a real variable (V « € R,...), in which case the real numbers can be 
eliminated from the definition. 

In the above, speaking of “real numbers” is nothing but a convenience 
and what really matters is to know whether our set can be defined from, 
say, closed sets by iterating countable intersections and unions countably 
many times or if we need to index certain operations with an uncountable 
set (which is usually metrizable, separable and complete, so that we keep 
some control on what goes on). 

In real life, all the sets we actually meet are measurable, but it must be 
stressed that quite often they are not Borel, and we will actually display in 
this survey many classes of separable Banach spaces which are not Borel. 

What we mean when speaking of a “Borel class” of Banach spaces has 
been made clear in Bossard’s work [3] (motivated by the seminal work [4]) 
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where it was shown for instance that the relation of linear isomorphism be- 
tween Banach spaces is complicated and that no decent set contains exactly 
one space in each isomorphism class (see Theorem 4.2 below). This work 
opened the way to several applications of descriptive set theory to Banach 
spaces, in particular through the use of (transfinite) uniform boundedness 
principles. In the last few years, the work of S. Argyros, P. Dodos and of co- 
authors deepened the theory with the discovery of “amalgamation” meth- 
ods which tighten the links (and stress the analogy) between set-theoretical 
and linear operations. These new techniques provide the right approach to 
universality problems, by showing that the classes SE(Y ) of isomorphic sub- 
spaces of a given Banach space Y are the generic hereditary and analytic 
classes of Banach spaces. 

Let us now outline the contents of this paper. Section 4.2 is a survey of 

the classical theory of analytic sets. The gist of the results from this section 
is that knowing that a tree T C w<“ of integers is not well-founded gives no 
way of finding the infinite branch which testifies that T is ill-founded. Sec- 
tion 4.3 displays Bossard’s frame, which allows a measure of the complexity 
of given classes of Banach spaces. The main results of this section explain 
the “non-algorithmic” nature of the isomorphic theory of Banach spaces. 
Section 4.4 presents some natural coanalytic ranks such as the Szlenk index 
(and related indices) on the coanalytic non-Borel class of spaces with sepa- 
rable dual, and motivations for their use. Finally, Section 4.5 introduces the 
recent “converse statements” where analytic collections of separable spaces 
are “glued together” (with a tree space) for being embedded into the set of 
subspaces of a “small” Banach space. The theory is far from complete and 
several open problems are mentioned in the text. 
Notation: We denote w = {0,1,2,3,...} the set of natural numbers. If 
E is a set, E* (respectively, ES“) is the set of sequences (respectively of 
finite sequences) of elements of FE. In particular, w” is the set of sequences 
of integers, and w<“ is the set of finite sequences of integers. If s € w<”, we 
denote |s| the length of s and t < s means that |t| < |s| and that s begins 
with t. We denote as well s < o if o € w” begins with s € w<”. The Cantor 
set 2” of subsets of w is denoted A. An ordinal is identified with the set of 
its predecessors. In particular, we denote w, the set of countable ordinals. 
If A and B are sets, we define 7; = A x B — A by 11(a,b) = a. The vector 
space of functions u : A — R which are 0 except for finitely many a € A 
is denoted co9(A). Other pieces of notation are classical or will be defined 
before use. 
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4.2. A Short Survey on Analytic Sets 


The space w” of sequences of integers is a metric complete separable space 
- in other words, a Polish space when equipped with the distance 


d(o,o') =} 7 2 6(o(i),0'(i)) 
tew 
where 6(n,k) = 0 ifn =k and 1 if not. It is easy to check that given any 


Polish space P, there exists a continuous onto map S': w” — P. 
The following definition is due to Souslin and goes back to 1917. 


Definition 4.1. A metric space WM is analytic if there exists a continuous 
onto map S:w” — M. 


It follows immediately from the definition that the class of analytic sets 
is stable under continuous images. 


Lemma 4.1. Let P be a Polish space, and A C P be a subset of P. The 
following are equivalent: 

(i) A is analytic. 

(it) There exists a closed subset F C P x w” such that A= 7 (F). 


Proof. (ii) = (i): F is closed in the Polish space P x w” hence it is Polish. 
Hence F is analytic, and A as well since it is a continuous image of F’. 

(i) > (ii): Let S = w” — P be a continuous map such that S(w”) = A. 
Since S' is continuous, the set 


F = {(S(o),0) € Px w”; c Ew} 
is closed in P x w”, and A= 71 (F). O 


We now observe that the space w” enjoys remarkable stability proper- 
ties. Indeed w” x w ~ w” and (w”)” ~ w”. This implies: 


Lemma 4.2. The class of analytic subsets of a given Polish space P is 
stable under countable unions and countable intersections. 


Proof. Let A, = 71(F,) with F,, a closed subset of P x w’. The set 
F = {(2,0,n) € Px w” xw; (2,0) € Fr} 
is a closed subset of P x w’ x w, and thus 


LJ An = ™(F) 


new 
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is analytic. On the other hand the set 
G={(2,(en)n) © P x (@*)”} Gon) € FB, for all n} 
is closed in P x (w”)”, and thus 
() An = 71 (G) 
new 
is analytic. L 


Since closed subsets of P are clearly analytic, Lemma 4.2 implies that 
every Borel subset of P is analytic, and thus one has: 


Lemma 4.3. If P, and P2 are Polish spaces and B C P, x P2 is a Borel 
subset of Py x P2 then m2(S) is analytic. 


This lemma provides a substitute to an erroneous statement of 
Lebesgue, who had claimed that the projection on a line of a Borel subset 
of the plane is Borel. This claim was actually the starting point of Souslin’s 
work, who showed that although such a projection is in general not Borel, 
it retains some properties of Borel sets such as universal measurability. 

We now recall the definition of a tree on integers: this is a subset T of 
w<” such that if s € T andt < s, then t € T. The boundary [T] of such 
a tree T is the set of all o € w® such that o), € T for all n € w. In other 
words, [7] consists of all the “infinite branches” of T. Every boundary is a 
closed subset of w”, and all closed subsets can be obtained in this way. 

In view of Definition 4.1, it is therefore not surprising that trees show 
up in this context. Indeed let 


A = 7 (F’) 
with F closed in P x w”. For s € w<”, let 
V, = {o €w"; s<o}. 


Let d be a distance which defines the topology of P, and for x € P and 
d > 0, let 


B(a, 6) = {y € P;d(az,y) < 6}. 
We define a tree T(x) as follows: 
T (2) = {s € w<”; (B(a, |s|“') x Vz) NF F 9}. 
It is easily seen that 
(z,0)E F soe [T(a)| 
and thus x € A = 7 (F) if and only if [T(a)] # ¢. 
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We now recall a classical piece of notation: a tree T is well-founded if 
[T| = ¢. With this notation, x € A if and only if T(x) is not well-founded. 

Let T C 2(**) be the set of all trees on integers (including the tree {¢}). 
This is a closed subset of ges") and thus a compact set. Let WF C T be 
the subset consisting of well-founded trees. This subset W F' is a transfinite 
union of “constituents” defined as follows: let w,; = {a;a < w;} be the set 
of countable ordinals. Given T' € 7, we define 


T’ ={s Ew"; there isn €w with s ne T} 


In other words, T’ consists of all s € T which can be extended to a longer 
sequence in T’. We define then by transfinite induction 


Tlatt) = creny 
and for 7 a limit ordinal 

TB) — () T(o). 

a< 
It is clear that if T ¢ WF, then for all a < w; and allo € [T] 
bs ={s<o} CT. 

Conversely, if s € T‘ for all a < wy, there is n € w such that s~n € T 
for all a, and it follows that if 


(| TO #¢ 


a<wy 


then T ¢ WF. If T € WF, we denote by h(T) the height of T, that is, the 
smallest a < w 1 such that T(~) = ¢. If T ¢ WF, we denote h(T) = w. 
Returning to A = 7) (Ff) an analytic subset of P, we define 


r: Pw ,U {wy} 
by r(x) = h(T(x)). By the above, A = r—'({wi}) and 
C=P\A= [J rl({\;A <a}). 


This map r is called a rank. Such a set C = P\A, whose complement is 
analytic, is called a coanalytic set and if we let 


Cua" “ek Zeal) 
we have C = (J Cg, and C appears as a transfinite increasing union of 


aewy, 
“constituents” . 
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The following combinatorial result on trees is an abstract version of 
Souslin’s separation theorem. We state it without proof. 


Theorem 4.1. For any countable ordinal a < w1, the set 
Bu={t 67 h{T) =< ae} 


is a Borel subset of T. Moreover, if A is an analytic subset of WF CT, 
then 

sup h(T) < wy. 

TEA 

It is an instructive exercise to show that for every a < w , there exists 
T € WF such that h(T) = a and then, Theorem 4.1 implies that WF is 
not analytic, and that T\WF is analytic but not Borel. 

If now A = 7(F) is an analytic set, the map r(x) = h(T(a)) defined 
above yields to the following applications of Theorem 4.1: the coanalytic set 
C = P\A isa transfinite union of the Borel sets Cy. = r~'(Ba). If Ai is an 
analytic subset of C,, there is a < w; such that A, C Cg. Hence two disjoint 
analytic sets can be separated by a Borel set: A; C Cg and A C P\Cyq. 

So far, we displayed the basic facts of Souslin’s theory within the frame 
of metric spaces. However, most of this can be done in standard Borel 
spaces, that is, in sets X equipped with a o-field B such that (X,B) is 
Borel-isomorphic with some Polish space equipped with its Borel o-field. In 
other words, (X,8) is a standard Borel space if there is a Polish distance 
on X whose Borel o-field coincide with 6. However this distance is not 
canonical and it plays little role. In order to introduce a “Borel way” to 
describe analytic sets, we return to the above definition of the trees T(z) 
and with the same notation, we observe that for all s € w<”, the set 


U, = {x € P;(B(z,|s|~+) x Vs) NF 4 $} 
is open in P, and 
A= |) [) Ue 
cow’ s<o 


This is called the Souslin operation, and it is usually denoted 
A= 5((Us)). 


This shows that every analytic set can be obtained from Borel sets {B,;s € 
w<“} through the Souslin operation S. Conversely, the class of analytic sets 
is stable under S: if the sets {A,;s € w<”} are analytic, so is S((As)). 
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We can therefore define a subset A of a standard Borel space (X, B) to 
be analytic if there exist {B,;s € w<”} C B with 


A=S((Bs)). 


The above theory (rank, constituents, separation, ...) applies and no 
reference is needed to a Polish distance on X. If (X,B) and (Y,B’) are 
standard Borel spaces, and F : X — Y is a Borel map, then F(X) is an 
analytic subset of Y, and every analytic set is obtained in this way. We 
refer to [6], [26], [31] for much more on Souslin theory and descriptive set 
theory and to [2] for some connections between this theory and geometry 
of Banach spaces. 


4.3. Bossard’s Coding of Separable Banach Spaces 


This section is devoted to a general frame (published in [3]) which makes it 
possible to define Borel, analytic or coanalytic families of separable Banach 
spaces. 

In order to do so, we need a proper parametrization of the collection of 
separable Banach spaces, which turns it into a set. 

If P is a Polish space, we denote F(P) the set of closed subsets of P. 
Let P be a metrizable compactification of P; with the Hausdorff distance, 
the set F (P) is compact metric, and the map F — F maps F(P) to the 


A A 


subset Fo(P) of F(P) defined by 
Fo(P) ={F CP; F =FNP}. 


It is easily checked that Fo(P) is a Gs subset of F(P) and thus it is home- 
omorphic to a Polish space. The corresponding Borel structure on F(P) 
does not depend upon the compactification, and it is called the Effros-Borel 
structure on F(P). It is generated by the sets 


By ={F €F(P);F NU £ $} 


where U is an arbitrary open subset of P. 

We denote by A = 2” the Cantor set. It is well-known that every sep- 
arable Banach space is isometric to a subspace of C(A) equipped with the 
supremum norm. 

Let SE(C(A)) be the set of all closed linear subspaces of C(A). It is 
easily checked that SE(C(A)) is a Borel subset of F(C(A)) equipped with 
the Effros-Borel structure. It is therefore a standard Borel space, as Borel 
subset of a standard Borel space. 
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For sake of shortness we now denote SE(C(A)) simply as SE. Let ~ by 
the equivalence relation of isomorphism: if (X,Y) € SE?, X ~ Y means 
that X is isomorphic to Y. 


Definition 4.2. A coding of separable Banach spaces up to isomorphism 
is an onto map from a set Z onto the set SE/ ~. The canonical coding is 
the quotient map 0: SE > SE/~. 


It could of course be objected at this stage that the “canonical” coding 
is not so canonical, since other parametrizations of Banach spaces could be 
chosen: for instance, any separable Banach space is a quotient of ¢;(w) and 
the kernels of such quotient map could be used. However, it is shown in [3] 
that such alternative parametrizations lead to the same complexity results. 

The space SE is a standard Borel space, but is there any chance to 
equip the quotient space SE// ~ with a usable structure of standard Borel 
space? The following result from [3] answers this question in the negative. 


Theorem 4.2. The relation ~ is analytic non-Borel; that is, the set 
IT={(X,Y)€ SE’; X~Y} 


is analytic non-Borel. There exists a space U € SE whose isomorphism 
class 


(UU) ={X € SE; X ~U} 
is analytic non-Borel. The relation ~ has no analytic section. 


Outline of Proof: 

This result relies on the fact that the set WF’ of well-bounded trees 
is coanalytic non Borel (see Theorem 4.1). We denote by U Pelczynski’s 
universal space (cf. [85]) which has a basis (wu, ) and contains complemented 
copies of every Banach space with a basis. Let || - || denote the norm on U. 
We now define a “tree space” as follows: for y € cop(w<“”), set 


5h. 1/2 
k 


Il villa==sup 4 |S) PSO y(s) moi 


j=0 ||s€l; 


where the supremum is taken over all k € w and over all admissible choices 
of intervals, namely all finite sets {I;;0 < j < k} of intervals w<” such that 
every branch of w<“” meets at most one of these intervals. 
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Let U(w<”) be the completion of cop(w<”) under this norm. If T € T 
is any subtree of w<”, let U(T’) be the space of all vectors from U(w<“”) 
supported by T or alternatively the completion of coo(T) under ||| -|| |2. 
Since WF is a non-Borel subset of 7, the fact that the isomorphism class 
(U) is non-Borel follows from the following three statements: 

(a) the map T'+> U(T) is Borel from T to SE. 

(b) If T € WF, U(T) is reflexive and thus U(T) ¢ (U). 

(c) If T ¢ WF, U(T) is isomorphic to U. 

Statement (a) is easy to check. Statement (b) follows from the fact that 
an (5-sum of reflexive spaces is reflexive through a transfinite induction. 
Statement (c) follows from the fact that if o € w’ and bs = {s < ao} 
is the corresponding branch, the space U(b,) is isomorphic to U. Now, if 
T ¢ WF, pick o such that 6, C T. Then U(b,) ~ U is a complemented 
(by restriction) subspace of U(T), and since U(T) has a basis it is isomor- 
phic to a complemented subspace of U. Now an application of Pelezynski’s 
decomposition scheme shows that U(T) ~ U. 

It remains to establish that the relation ~ (that is, the set I) is ana- 
lytic, and this follows quite easily from its definition by “there exists an 
isomorphism”. Since the intersection 


(U) = IN (SE x {U}) 


is non-Borel, the relation ~ is analytic non-Borel. Finally the existence of 
a non-Borel class shows that there is no analytic set A which meets every 
class in exactly one point. Indeed, if A were such a set let {X } = AN(U) and 
let A’ = A\{X}. Since A’ is analytic as well as ~, its saturation SE\(U) 
would be analytic. But since (U) is analytic, Souslin’s separation theorem 
would show that both are Borel, contradicting the fact that (U) is not. O 

The proof of Theorem 4.2 is typical of which one should expect in this 
context: given a tree T’,, one tries to construct a copy of U in U(T). This 
will be possible exactly when T ¢ WF. If T € WF but is very high, it will 
take “very long” to check that U(T) does not contain U, and this cannot 
be done in any “algorithmic” way. 

We note that condition (b) from the above proof shows that the class 
of reflexive spaces is coanalytic non-Borel. 

The above proof shows that the isomorphism class (U) of U is analytic 
non-Borel. On the other hand, Kwapien’s theorem (see [28]) easily implies 
that (¢2) is Borel. It is not known whether this characterizes ¢2: 
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Problem 4.1. Let X be a separable Banach space whose isomorphism class 
(X) is Borel. Is X isomorphic to l2? 


It follows from Bourgain’s work [5] on £, space that if X = L, with 
1 < p< oo, p # 2 then (X) is not Borel (see [5], Th. 4.35). The case of 
X =o is particularly interesting: it is not known (see [19], Remark 2 after 
the Theorem 5.6) whether co is up to isomorphism the only predual of 1; 
with a summable Szlenk index. Such a characerization of co would show 
that its isomorphism class is Borel. 

Clearly, the proof of Theorem 4.2 shows as well that the class of spaces 
which contain an isomorphic copy of U is analytic non-Borel. This is a 
special case of the following result from [3], whose proof follows similar 
lines. 


Theorem 4.3. Let Z be an infinite-dimensional separable Banach space. 
Then the class Cz of separable Banach spaces which contain an isomorphic 
copy of Z is analytic non-Borel. 


Let us mention that Theorem 4.3 remains true when “isomorphic” is 
replaced by “isometric”, in which case it even extends to every space Z of 
dimension greater than 1 (see [20]). 

The article [3] contains much more results similar to the above, whose 
common feature is to assert that the natural classes of Banach spaces are as 
complicated as they look at first sight. For instance, the class of separable 
spaces not containing ¢; with non-separable dual is the difference A;\A2 of 
two analytic sets and is not simpler. One natural question seems however to 
be left open. Before stating it, let us recall that the relation of embedding, 
in other words the set 


J={(X,Y) €SE*; ]Z CY with X ~ Z} 


is analytic (non-Borel), and then for every space Y, the class SE(Y ) of space 
which are isomorphic to subspaces of Y is analytic. Is this really optimal? 


Problem 4.2. Does there exist a separable space Y such that SE(Y) is 
non-Borel? 


Let us note that for some space Y, such as fg or C(A) it is actually 
Borel. The behaviour of the isomorphism relation ~ restricted to such 
classes SE(Y) is not well-understood despite important recent progress 
(see [15], [37], [14], [16]). Actually the following improvement of Theorem 
4.2 has been shown in [17]: the relation of isomorphism between separable 
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Banach spaces is a complete analytic equivalence relation. In other words, 
this isomorphism relation enjoys maximal complexity among all analytic 
equivalence relations. Thus, separable Banach spaces up to isomorphism 
provide complete invariants for essentially every isomorphism problem in 
analysis, with the by-product that classifying Banach spaces up to isomor- 
phism is definitively beyond reach. 

Gowers’ solution to the homogeneous space problem (see [21]) states 
that Y ~ @ if and only if {SE*(Y)/ ~} is a singleton, where SE* denotes 
the set of infinite dimensional closed subspaces. However the following prob- 
lem is still open. 


Problem 4.3. Let Y be an infinite-dimensional separable Banach space 
which is not isomorphic to 2. Does Y contain subspaces {Xy;n € w} such 
that X,, is not isomorphic to X, ifn #k? 


4.4. Coanalytic Ranks 


In Section 4.2 we discussed heights of trees and the corresponding rank 
function r associated with a representation A = 72(F’) of an analytic set as 
projection of a closed set. This yields to a general definition. 


Definition 4.3. Let P be a standard Borel space and C' be a coanalytic 
subset of P. A coanalytic rank o on C is a map 0 = P — uw U {wu} such 
that 

(i) C= {x2 € P; o(x) < wy} 

(ii) {(x, y) € C x P; a(x) < a(y)} is coanalytic in P?. 
(iii) {(z,y) € C x P; o(x) < o(y)} is coanalytic in P?. 


For every coanalytic set C’, there exists such a coanalytic rank, and the 
classical result below, which extends Theorem 4.1, states that this rank is 
quite canonically associated with C. 


Theorem 4.4. Let o be a coanalytic rank of the coanalytic set C. Then: 
(i) For every a < wy, the set By = {x € C30(x) < a} is Borel. 
(i) If ACC is analytic, there is a < w, such that AC By. 
(iit) If o is another coanalytic rank on C, there exists d : w, > wy such 
that if a(x) <a, then o'(x) < d(a). 


Condition (iii) above asserts that all coanalytic ranks on C' are some- 
what equivalent (a special case of the Kunen-Martin uniform boundedness 
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principle). Indeed the set of all a’s such that o~'({0,a[) = 0’~1([0, af) is a 
closed and cofinal subset of w4. 

Natural coanalytic ranks can be displayed on the coanalytic classes from 
section 4.3, such as an “embedding rank” on the class Gz = SE\Cz of 
Banach spaces which do not contain an isomorphic copy of a given space Z 
(cf. [3]). However the most interesting examples where (iii) can be applied, 
come from Cantor-Bendixon derivations. 

We now define this concept, and refer to [25] for its applications to 
harmonic analysis. Let kK be a metrizable compact space, and F(x) be the 
set of closed subsets of K equipped with the Hausdorff topology. A Borel 
derivation 


d: F(K) > F(K) 
is a Borel map such that 
(i) d(F) C F for all F. 
(ii) If F C G then d(F’) C d(G). 
Such a derivation can be iterated: if a < w 1, define 

dt) (F) = dd (F)) 
and for @ < w; a limit ordinal, 

d9) (F) = () d\) (F) 

a< 
let now 
oa(F) = min{a; d‘ (F) = ¢}. 


If such an ordinal exists, and w; otherwise. Then one has: 


Theorem 4.5. Let {d,;n € w} be a countable family of Borel derivations. 
Then the map X(F) = sup{oa,(F);n € w} is a coanalytic rank on the 
coanalytic set C= {F € F(K);04a,(F) < w1 for all n}. 


Note that the special case of a single derivation (when all d,,’s coincide) 
is very important. 

The first example of derivation, due to Cantor, is when d(F’) is the set 
of accumulation points of Ff’. We now define a similar concept: Let F' be a 
w*-compact subset of the space 4, = (7 and € > 0. We define 


d.(F') = {x € F;||- || —diam (V1 F) > € for all V} 
where V runs through the set of all w*-open sets containing x, and 


6(F) = {xe F,||-|| -— diam (HF) >. for all H} 
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where H runs through the set of all w*-open half-spaces containing x. It is 
clear that d. and A, are derivations, and it can be checked that they are 
Borel on F(By,,). Let now 


o(F) = 04, (F) 


and 


By Theorem 4.5 the map 
o(F’) =sup{oi/z(F);k € w} 
is a coanalytic rank on the set 
C={F € F(Bx-«);0<(F) < w; for all € > O}. 
Although the obvious inclusion 
de(F’) € 5-(F) 


is in general strict, it follows from a result of [32] that C' coincide with the 
set of F’s for which o/(F’) < w, for all € > 0. Hence if 


o'(F) =sup{oi,(F), k ew} 


it follows from Theorems 4.4 (iii) and Theorem 4.5 that there exists a map 
@ : wy — w, such that o(F’) < a implies o’(F’) < ¢(a). Hence, although 
the derivation 6 which consists removing small slices is slower than d which 
allows the removal of small w*-open sets, it is not much slower. Actually, 
as noted after Theorem 4.4, there is a closed cofinal subset C of w, such 
that if a € C then o(F) < a if and only if o/(F) <a. 

This theory yields to natural indices defined on separable Banach spaces. 
Let X be separable. Since there exists Q : 0; — X a quotient map, the dual 
unit ball Bx» embeds (in w* and norm topologies) into By,,, and we can 
define 


oo? 


Sz(X) = o(Bx«) 
Dz(X) = o'(Bx«). 


These indices are called the Szlenk index (following [39] where it is defined 
in another equivalent way) and the dentability index (see e.g. [19]). It is 
easily seen that these indices do not depend upon the choice of the quotient 
map Q, and that Sz (respectively, Dz) can be defined by removing from 
Bx those w*-open sets (respectively, w*-slices) which are small in norm. 
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The frame provided by By, shows however that Dz(X) < ~(Sz(X)), where 
wy =w,U{w,} — w,U{w 1} is a universal map such that ~(w,) = w,. Indeed 
Sz(X) < uw, if and only if Dz(X) < w, if and only if X* is separable. 
A separable reduction argument shows that w actually works for every 
Asplund space (see [29]). 

The first value of the map 7 is known: if Sz(X) = w, then Dz(X) < w? 
and this estimate is optimal ((22]). Actually, Dz(X) = w if and only if 
X is super-reflexive, and if Sz(X) = w and X is not super-reflexive then 
Dz(X) = w?. Note that a third index, denoted C'z(X) in [19], can be defined 
through the removal of w*-open slices which can be covered by a union of 
small w*-open sets. Equivalently, the convex Szlenk index C'z(X) is defined 
by the derivations 


* 


AAP) =conv” (d.(F)). 
It is clear that 
Sax) < C2aX) <= Dz(X). 


It is shown in [22] that Dz(X) < w’ -Cz(X). It is known that the Szlenk 
index of any space with separable dual has the form w® for some a < w}. 
The values of ~(w%) for a > 2 are not known. The existence of a cofinal 
subset $” of w; such that if a € S’ and Sz(X) =a then Dz(X) = a follows 
from an application of the classical “pressing-down” lemma (see [30], Prop. 
6.4). 

There are several motivations for studying these indices. First of all, 
they are invariants for linear isomorphisms and this sometimes lead to clas- 
sification results: for instance, if AK and L are countable compact spaces, 
then C(K) ~ C(L) if and only if Sz(C(K)) = Sz(C(L)). Even more, the 
condition Sz(X) = w is equivalent to Cz(X) = w and is invariant under 
uniform homeomorphism ([19]) and in particular under Lipchitz isomor- 
phism. 

Also, they happen to be useful in renorming theory. This follows from 
the fact that the derivations 6, and A, maps convex sets to convex sets, and 
convex combinations of distances from these convex sets yield to dual norms 
with good convexity properties and thus to smooth equivalent norms. We 
provide two examples: using the inequality Cz(X) < y(Sz(X)) for all X 
with separable dual, it is shown in [29] that every space Y such that 


sup{Sz(X); X separable, X CY} <u 


has an equivalent Fréchet-smooth norm. Using the quantitative behaviour 
of d. and A,, it is shown in [19] that Sz(X) = w if and only if X has 
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an asymptotically uniformly smooth equivalent norm (a result previously 
shown in [27] with weaker quantitative estimates). 

We refer to [12], [10] and [11] for more applications of Szlenk derivations 
to Lipschitz isomorphisms, to [18] and [30] for surveys on the Szlenk index, 
and to the Chapter 2 of [23] for a presentation of the whole theory. 

To make a link with section 4.3, let us mention that the Szlenk index 
Sz(X) and the dentability index Dz(X) are coanalytic ranks on the coana- 
lytic subset of S'E consisting of spaces with a separable dual ({3], Th. 4.11). 
However, it is still not known if the index r(X) = sup(Sz(X), Sz(X*)) is 
a coanalytic rank on the coanalytic set of reflexive spaces, although the 
corresponding index on the set of bases of reflexive spaces is known to be 
a coanalytic rank ([3], Cor. 5.5). 

We will see in the next section how to use such ranks for showing the 
existence of certain universal spaces. 


4.5. A New Direction: The Converse Statements 


The main argument in the proof of Theorem 4.2 is that the “tree space” 
U(w<”) satisfies the following conditions: if o € w’ and b, = {s < o} is 
the corresponding branch, then U(b,) ~ U while if T’ € WF the space 
U(T) is not isomorphic to U (and is actually reflexive). Under one form or 
another, similar arguments are frequently used for proving that a collection 
of Banach spaces is analytic non Borel. 

A remarkable progress due to S$. Argyros and P. Dodos is that such 
constructions are actually generic. Indeed they showed in [1] that a usable 
analogue of Definition 4.1 is available for analytic families of Banach spaces 
with Schauder bases. 

We need some terminology. If A is a non-empty set, a pruned tree on /A 
is a subset T of A<“ such that for every s € T, there is s < 0 € A” such 
that bs = {s <o} CT. We denote [T] = {0 € A”; bg CT}. 

With this notation, let us call a Schauder tree space on T a Banach 
space X equipped with a collection {2,;¢ € T} of normalized vectors such 
that 
(i) X = span{az;t € t} 

(ii) For every o € [T], the sequence {x,,,,;n > 1} is a bi-monotone Schauder 
basic sequence. 

We can now state the following fundamental lemma (see [1], Prop. 83): 


Lemma 4.4. Let A C SE be an analytic set of Banach spaces, such that 
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every Y € A has a Schauder basis. Then there exist a pruned tree T on w 
and a Schauder tree space X on T such that 
(1) For every Y € A, there is o € [T] such that 


YaXk,= SPaN{ Lo),,; 7 > 1. 
(2) For every o € [T], there is Y € A such that Xo ~Y. 


This lemma turns out to be a very powerful tool for constructing “small” 
universal spaces for the analytic set A. For instance, given a Schauder tree 
space X, let us consider like in the proof of Theorem 4.2 (using some ideas 
from [24]) its “C2 Baire sum” T3* defined by the norm 


a 
2 2 
£ 


ella = sup ¢ | $0 |] > 208) & 


j=0 ||tES; x 


where the supremum is taken over all choices of intervals such that every 
branch b, meets at most one of them. It is clear that for all o € [T], X, 
is isometric to the subspace of T;* of vectors supported by b,. And on 
the other hand, T;* is “close enough” to the family A, in the sense where 
some control is available on subspaces of T;* which would not belong to A. 
This construction, with several technical refinements (since for using [1], 
one needs to construct spaces with bases containing the elements in A, in 
a controlled way from the Effros-Borel point of view) allows to show that 
several natural classes of Banach spaces are strongly bounded, according to 
the following definition. 


Definition 4.4. A subset C of SE is said to be strongly bounded if for 
every analytic subset A of C, there exists Y € C such that A is contained 
in the set SE(Y) of spaces which are isomorphic to a subspace of Y. 


Several natural coanalytic classes of separable Banach spaces have been 
shown to be strongly bounded (and others are conjectured to be). To make 
a link with the result shown in section 4.2 asserting that coanalytic sets C 


can be written 
G= |} Gy 


aw, 
with C, Borel sets, we note that strongly bounded coanalytic classes C can 
be written 


C= |) S8Rz) 


aw, 
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and thus a transfinite family (Ya)a<w, of spaces from C suffices to exhaust 
the class. Let us now provide examples. 


Theorem 4.6. ([8]): The following classes are strongly bounded: 
1) The class R of reflexive spaces. 
2) The class SD of spaces with a separable dual. 


It follows from 1) that there exists a reflexive space Z such that X € 
SE(Z) for every uniformly convex space X (a result from [33]). Indeed the 
existence of an equivalent uniformly convex norm (in other words, super- 
reflexivity) is a Borel condition. 

As said before, it is not known if the functional 


r(X) =sup(Sz(X), Sz(X"*)) 


is a coanalytic rank on R (a question from [3]). However, it is known ({34]) 
that for every a < wy, there is a reflexive space Z, such that S&(Z,) 
contains all reflexive spaces X such that Sz(X) < a and Sz(X*) < a. Note 
that it is also proved in [34], using the universality results shown there, 
that the class C, of reflexive spaces X such that r(X) < a is analytic for 
every countable ordinal a. Hence the main result of [34] appears to be a 
consequence of Theorem 4.6. So far however, no direct proof is known of 
the fact that C, is analytic. 

On the class SD, the Szlenk index Sz(-) is a coanalytic rank. Hence 2) 
shows that for every a < wj, there is Yy such that Y;* is separable and 
X € SE(Y,) if Sz(X) < a. This answers positively a question from [37]. 


Theorem 4.7. ({7]): The class of non-universal spaces is strongly bounded. 


Actually, P. Dodos shows the existence, for every \ > 1, of a transfinite 
family (Yo)a<w, Of non-universal £..,, spaces, such that 


Geay = LJ SE(¥a) 


aw, 


where Gea) denotes the collection of spaces which do not contain an iso- 
morphic copy of C(A); in other words, the collection of non-universal spaces. 
The coanalytic set Gera) is equipped with a natural coanalytic rank 
obtained from a tree (see [3], Th. 4.4), and this is also true for the set Gy 
where Y is infinite-dimensional but arbitrary. However, it is not known if 
every class Gy is strongly bounded; in particular, it is not known for Gg,. 
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Note that there is no space Z € Gy such that Gy = SE(Z). Indeed, 
SE(Z) is analytic while Gy is coanalytic non Borel ([3]). Hence, every class 
Gy is hereditary with no universal space. 


Theorem 4.8. ([9]): The class of unconditionally saturated Banach spaces 
is strongly bounded. 


In this statement, “X unconditionally saturated” means that every 
subspace of X contains an unconditional basic sequence, or equivalently 
(by [21]) that X does not contain an hereditary indecomposable (in short, 
H.I) subspace. Hence if we decide once and for all to live in a universe from 
where H.I. spaces are banned (in other words, if we restrict our attention to 
unconditionally saturated Banach spaces) then the above theorem asserts 
that strong boundedness holds in our world. 
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In a recent paper, the authors have constructed a large class of operators 
in the Cowen-Douglas class of the unit disc D which are homogeneous 
with respect to the action of the group Mob — the Mobius group consist- 
ing of bi-holomorphic automorphisms of the unit disc D. The associated 
representation for each of these operators is multiplicity free. Here we give 
a different independent construction of all homogeneous operators in the 
Cowen-Douglas class with multiplicity free associated representation and 
verify that they are exactly the examples constructed previously. 


The homogeneous operators form a class of bounded operators JT’ on a 
Hilbert space H. The operator T' is said to be homogeneous if its spectrum 
is contained in the closed unit disc and for every Mobius transformation g 
the operator g(T’), defined via the usual holomorphic functional calculus, 
is unitarily equivalent to TJ’. To every homogeneous irreducible operator T 
there corresponds an associated unitary representation 7 of the universal 
covering group G of the Mobius group G: 


m(9)* T m(g) = (p9) (T), G € G, 


where p : G —> G is the natural homomorphism. In the paper [6] (see 
also [3]), it was shown that each homogeneous operator T, not necessarily 
irreducible, in B,,+1(D) admits an associated representation. The repre- 
sentations of G are quite well-known, but we are still far from a com- 
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plete description of the homogeneous operators. In the recent paper [6], the 
following theorem was proved. 


Theorem 5.1. For any positive real number X > m/2, m € N and an 
(m+1)-tuple of positive reals pu = (Mo, [1,---;m) with po = 1, there exists 
a reproducing kernel KO) on the unit disc such that the adjoint of the 
multiplication operator MO) on the corresponding Hilbert space AO) (D) 
is homogeneous. The operators (M))* are in the Cowen-Douglas class 
Bm+1(D), irreducible and mutually inequivalent. 


In the paper [6], we have presented the operators M) in as elemen- 
tary a way as possible, but this presentation hides the natural ways in which 
these operators can be found to begin with. Here we will describe another 
independent construction of the operators MO), We will also give an ex- 
position of some of the fundamental background material. Finally, we will 
prove that if T is an irreducible homogeneous operator in B,,+1(D) whose 
associated representation is multiplicity free then, up to equivalence, T’ is 
the adjoint of of the multiplication operator MO) for some ’ > m /2 and 
> 0. 


5.1. Background Material 


Although, we intend to discuss homogeneous operators in the Cowen- 
Douglas class B,,(D), the material below is presented in somewhat greater 
generality. Here we discuss commuting tuples of operators in the Cowen- 
Douglas class B,(D) for some bounded open connected set D C C™. The 
unitary equivalence class of a commuting tuple in B,(D) is in one to one 
correspondence with a certain class of holomorphic Hermitian vector bun- 
dles (hHvb) on D (cf. [4]). These are distinguished by the property, among 
others, that the Hermitian structure on the fibre at w € D is induced by 
a reproducing kernel Kk. It is shown in [4] that the corresponding operator 
can be realized as the adjoint of the commuting tuple multiplication oper- 
ator M on the Hilbert space H of holomorphic functions with reproducing 
kernel K. 

Start with a Hilbert space H of C”-valued holomorphic functions on a 
bounded open connected set D C C™. Assume that the Hilbert space 1 
contains the set of vector valued polynomials and that these form a dense 
subset in 7H. We also assume that there is a reproducing kernel K for 7H. 
We use the notation K,,(z) := K(z,w). 
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Recall that a positive definite kernel K:D x D — C”*"” on D defines an 
inner product on the linear span of {Kw,(-)€ : w € D,€ € C”} C Hol(D, C”) 
by the rule 


(Band JE, Bal) = (oat @ye, nN); E,7 EC”. 


(On the right hand side (,) denotes the inner product of C”. We denote 
by €1,...,€n the natural basis of C”.) The completion of this subspace is 
then a Hilbert space of holomorphic functions on D (cf. [1]) in which the 
linear span of the set of vectors {Kw(-)€ : w € D} is dense. The kernel K 
has the reproducing property, that is, 


(f, Kw€) = (f(w),€), fe H, we D, €eC™. 
Now, for 1 <i < m, we have 
Mj Kw€ = Wi Kwé, w € D, where (Mi f)(z) = zif(z), f EH 


and {Kw»é;}7_, is a basis for NZ, ker(M; — w;)*, w € D. 

The joint kernel of the commuting m-tuple M* = (M7y,..., M*,), which 
we assume to be bounded, then has dimension n. The map o; : wr Kywé, 
wéeD,1<i<n, provides a trivialization of the corresponding bundle E 
of Cowen-Douglas (cf. [4]). Here D := {z € C™ | z € D}). 

On the other hand, suppose we start with an abstract Hilbert space 
H and a m-tuple of commuting operators T = (71,...,7Zm) in the Cowen- 
Douglas class B,(D). Then we have a holomorphic Hermitian vector bundle 
E over D with the fibre F,, = N?_, ker(T; — w;) at w € D. Following [4], 
one associates to this a reproducing kernel Hilbert space H consisting of 
holomorphic functions on D as follows. Take a holomorphic trivialization 
o,:D— H with o;(w), 1 <i <n, spanning E,,. For f € H, define fj (w) — 
(f,0;(W))n, w € D. Set Ge Iq = (f.g)n. The function Kye; := a; (Ww) 
then serves as the reproducing kernel for the Hilbert space H. Note that 


(Kw (z)€j,€:)en = (Kwej, Kees) q 
= (5;(w), o:(2)) 
= (0;(W), 0:(Z)) Hn, z,w € D. 


If one applies this construction to the case where 1 is a Hilbert space 
of holomorphic functions on D, possesses a reproducing kernel, say K, and 


the operator M* is in B,(D) then using the trivialization o;(w) = Kwéi, 
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w €D for the bundle E defined on D, the reproducing kernel for H is 
(Kw(z)ej,€:)em = (Kwej, Kze;) 0 


= (05 (W), 1(2)) 


= (Kyé;, K2€i)9, Z,w € D. 
Thus H = H. 

Let G be a Lie group acting transitively on the domain D C C?. Let 
GL(n,C) denote the set of non-singular n x n matrices over the complex 
field C. We start with a multiplier J, that is, a smooth family of holomorphic 


maps J, :D — C"*” satisfying the cocycle relation 
Jgn(z) = Jn(z)Jg(h-z), forall 9 hEG, zED, (5.1) 


Let Hol(D,C”) be the linear space consisting of all holomorphic functions 
on D taking values in C”. We then obtain a natural (left) action U of the 
group G on Hol(D, C”): 


(Ugf)(z) = Jg-1(z) f(g" +z), f € Hol(D,C”), ze D. (5.2) 


Let K C G be the compact subgroup which is the stabilizer of 0. For h, k 
in K, we have Jpn (0) = Jn (0) J, (0) so that k + J;,(0)~! is a representation 
of K on C”. 

As in [6], we say that if a reproducing kernel K transforms according 
to the rule 


J(9,2)K(9(Z), 9(w)) J (g,w)* = K(z,w) (5.3) 
for all g € G; z,w € D, then K is quasi-invariant. 
Proposition 5.1 ([6], Proposition 2.1). Suppose H has a reproducing 


kernel K. Then U defined by (5.2) is a unitary representation if and only 
if K is quasi-invariant. 


Let gz be an element of G which maps 0 to z, that is g, -0 = z. 
For quasi-invariant K we have 


K (gz -0,92° 0) = (Jo, (0))~'K(0,0)(Jg. (0)*)~?, (5.4) 


which shows that A(z, z) is uniquely determined by K (0,0). For each z in 
D, the positive definite matrix K(z, z) gives the Hermitian structure of our 
vector bundle. 

Given any positive definite matrix A(0,0) such that 


J,.(0)~'K (0,0) = K(0,0)J,(0)* for allk € K, (5.5) 
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that is, the inner product (K (0,0): | -) is invariant under J;(0), (5.4) defines 
a Hermitian structure on the homogeneous vector bundle determined by 
J(z). In fact, A(z, z), for any z € D is well defined, because if g? is another 
element of G such that gi -0 = z then gi = g-k for some k € K. Hence 


K(g,-0,g, -0) = K(g-k - 0,g-k - 0) 
= (Jy ae (0, 0)( a : 
= (Jk (0) Jo. (#0) K (0,0) (Jp, (& - 0)" Je (0)") 
= (Jq.(0))*(Je(0))~ “ 0) C0)" "We, OF) 
= (Jg,(0))~*K (0, 0)(J,,(0)*)-* 
= Kg 0g,°0). 


This gives a good overview of all the Hermitian structures of a homo- 
geneous holomorphic vector bundle. But not all such bundles arise from a 
reproducing kernel. Starting with a positive matrix satisfying (5.5), (5.4) 
gives us K(z,z), but there is no guarantee (and is false in general) that 
K(z, z) extends to a positive definite kernel on D x D. It is, however, true 
that if there is such an extension then it is uniquely determined by K(z, z) 


= 


(because K(z,w) is holomorphic in z and antiholomorphic in w). 

This leaves us with the following possible strategy for finding the ho- 
mogeneous operators in the Cowen-Douglas class. Find all multipliers, (i.e., 
holomorphic homogeneous vector bundles (hhvb)) such that there exists 
K (0,0) satisfying (5.5) and consider all such K(0,0). Then determine which 
of the K(z,z) obtained form (5.4) extends to a positive definite kernel on 
DxD. Then check if the multiplication operator is well-defined and bounded 
on the corresponding Hilbert space. 

Let 1 be a Hilbert space consisting of C”-valued holomorphic functions 
on some domain DP possessing a reproducing kernel K. The sections of 
the corresponding holomorphic Hermitian vector bundle defined on D have 
many different realizations. The connection between two of these is given by 
an Xn invertible matrix valued holomorphic function y on D. For f € H, 
consider the map Ty : f 1» f, where f(z) = y(z) f(z). Let H = {f : f € H}. 
The requirement that the map Dy, is unitary, prescribes a Hilbert space 
structure for the function space H. The reproducing kernel for H is easily 
calculated 


K(z,w) = p(z)K(z, w)y(w)”*. (5.6) 


It is also easy to verify that [T,MV% is the multiplication operator 
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M: i > Zz f on the Hilbert space H. Suppose we have a unitary representa- 
tion U given by a multiplier J acting on 1 according to (5.2). Transplanting 
this action to H under the isometry I'y, it becomes 


(Uy f) (2) = Jg(z)F (9-2), 


where the new multiplier J is given in terms of the original multiplier J by 


Jaz) = p(z)Jg(z)e(g - le 


Of course, now K transforms according to (5.3), with the aid of J. If we 
want, we can now ensure that, by passing from H to an appropriate H, 
K(z,0) = 1. We merely have to set y(z) = K(0,0)!/?K(z,0)~!. Thus the 
reproducing kernel K is almost unique. The only freedom left is to multiply 
y(z) by a constant unitary n x n matrix. Once the kernel is normalized, we 
have 


JRlZ) = JehO), zE€D, keK. 
In fact, 
I = K(z,0) = J,(z)K(k- z,0)J,(0)* = Je (z) Jn (0) 


and the statement follows. Therefore, once the kernel K is normalized, we 
have 


(Ux-1f)(z) = Jn(0) f(k-z), kEK. 


Given a multiplier J, there is always the following method for construct- 
ing a Hilbert space with a quasi-invariant Kernel K transforming according 
to (5.4). We look for a functional Hilbert space possessing this property 
among the weighted L? spaces of holomorphic functions on D. The norm 
on such a space is 


Wie [se t(2 (2)aV(2) (5.7) 


with some positive matrix valued function Q(z). Clearly, this Hilbert space 
possesses a reproducing kernel kK. The condition that U,-1 in (5.2) is uni- 
tary is 


i f(g 2) JE(2)Q(2)Jg(2F(g- 2)dV (z =f f(w)"Q(w) f(w)AV (w) 
D 


a(g- 2) | 
A(z) 


= 7. fg-2)'Q(9-2)f (9-2) dV (2), 
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—2 


O(g- 2) (5.8) 


Q(9-2) = Ile)" Qe)Iy(e) | 


which is equation (5.3) with J,(z) replaced by AG) Jy(z)*-1. 

Given the multiplier Jj(z), Q(z) is again determined by Q = Q(0), and 
(just as in the case of K (0,0) = A) it must be a positive matrix commuting 
with all J,(0), & € K. (It is assumed that each J;,(0) is unitary). 

In this way, we can construct many examples of homogeneous operators 
in B,(D) but not all. 

Even, not all the the homogeneous operators in B;(D) come from this 
construction. There is a homogeneous operator in the class B;(D) corre- 
sponding to the multiplier J(g,z) = (g'(z))*, A € R exactly when A > 0. 
The reproducing kernel is K(z,w) = (1 — zw)~?’. But such an operator 
arises from the construction outlined above only if A > 1/2. 

Never the less, the homogeneous operators constructed in the manner 
described above are of interest since they happen to be exactly the subnor- 
mal homogeneous operators in this class (cf. [2]). 


5.2. Computation of the Multipliers for the Unit Disc 


In the case of B,,(D), it is shown in [6] that the bundle corresponding to 
a homogeneous Cowen-Douglas operator admits an action of the covering 
group G of the group G = Mob via unitary bundle maps. This suggests the 
strategy of first finding all the homogeneous holomorphic Hermitian vector 
bundles (a problem easily solved by known methods) and then determining 
which of these correspond to an operator in the Cowen-Douglas class. 

We are going to use the method of holomorphic induction. For this, first 
we describe some basic facts and fix our notation. We follow the notation 
of [7] which we will use as a reference. 


The Lie algebra g of G is spanned by X; = $ a ") x= 4 (; » 


0 
Xo. In the complexified Lie algebra g©, we mostly use the complex basis 


and Y = $ (; ): The subalgebra € corresponding to K is spanned by 
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h,x,y given by 


We write G© for the (simply connected group) SL(2,C). Let Go = 
SU (1,1) be the subgroup corresponding to g. The group G© has the closed 


subgroups K© = LG 4 :Z€ C,2 40}, Pes ic ) sé ech, —_—— 


{() *) :ZE c}; the corresponding Lie algebras © = {(6 °) :CE c}, 


a {(. ‘) :ce€ ch, i {(" ) :CE c} are spanned by h, x and 
0 0 c 0 

: t 04a€C,bEC} isa 

closed subgroup to be denoted T’; its Lie algebra is t = Ch+ Cy. The prod- 

uct set P+K©P- = P*tT is dense open in G®, contains G, and the product 

decomposition of each of its elements is unique. (G°/T is the Riemann 


y, respectively. The product K°P~ = { ( 


sphere, gK — gT, (g € G) is the natural embedding of D into it.) 

According to holomorphic induction ({5, Chap. 13]) the isomorphism 
classes of homogeneous holomorphic vector bundles are in one to one cor- 
respondence with equivalence classes of linear representations 9 of the pair 
(t, Rk). Since K is connected, here this means just the representations of t. 
Such a representation is completely determined by the two linear transfor- 
mations o(h) and o(y) which satisfy the bracket relation of h and y, that 
is, 


[o(h), o(y)] = —o(y). (5.9) 


The G-invariant Hermitian structures on the homogeneous holomorphic 
vector bundle (making it into a homogeneous holomorphic Hermitian vector 
bundle), if they exist, are given by o(K)-invariant inner products on the 
representation space. An inner product is o(K)-invariant if and only if o(h) 
is diagonal with real diagonal elements in an appropriate basis. 

We will be interested only in bundles with a Hermitian structure. So, 
we will assume without restricting generality, that the representation space 


of g is C4 and that o(h) is a real diagonal matrix. 
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Furthermore, we will be interested only in irreducible homogeneous 
holomorphic Hermitian vector bundles, this corresponds to @ not being 
the orthogonal direct sum of non-trivial representations. Suppose we have 
such a 9; we write Vy for the eigenspace of e(h) with eigenvalue a. Let 
—n be the largest eigenvalue of o(h) and m be the largest integer such 
that —n,-(n + 1),...,-(7 + m) are all eigenvalues. From (5.9) we have 
eNVe G Vat this and orthogonality of the eigenspaces imply that 
Y = Bio V_(nts) and its orthocomplement are invariant under 0. So, 
V is the whole space, and have proved that the eigenvalues of o(h) are 
—n,..-,-(n +m). 

From this it is clear that @ can be written as the tensor product of the 
one dimensional representation o given by o(h) = —n, o(y) = 0, and the 
representation @° given by @°(h) = o(h) + nl, 0°(y) = o(y). Correspond- 
ingly, the bundle for @ is the tensor product of a line bundle L, and the 
bundle corresponding to @°. 

The representation 0° has the great advantage that it lifts to a holo- 
morphic representation of the group 7’. It follows that the homogeneous 
holomorphic vector bundle it determines for D, G , can be obtained as the re- 
striction to D of the homogeneous holomorphic vector bundle over G/T ob- 
tained by ordinary induction in the complex analytic category. So, (as a con- 


l 
venient choice) take the local holomorphic cross section z +> s(z) := ( 0 ) 


of G°/T over D. In the trivialization given by s(z), the multiplier then 


appears for g = (° ’) EG as 


Jy (2) = 0° (s(z)~*g7*s(g - z)) 
9 fra 0 
“e ( —c ee! 


— 0° (exp (= i!) ) 0° (exp(2log(cz +.d)h)). (5.10) 


The last two equalities are simple computations. 
For the line bundle L,,, the multiplier is g’(z)” (we write g’(z) = Ba Z(z)). 
Consequently, the multiplier corresponding to the original g is 


IZ = (9'(z)) "JO (z). (5.11) 
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5.3. Conditions Imposed by the Reproducing Kernel 


We now assume that we have a homogeneous holomorphic vector bundle 
induced by @ as in the preceding sections and that it has a reproducing 
kernel. Then we derive conditions about the action of G that follow from 
this hypothesis. In the final section, we will show that these conditions 
are sufficient: they lead directly to the construction of all homogeneous 
operators the Cowen-Douglas class with multiplicity free representations. 

Under our hypothesis there is a Hilbert space structure on our sections 
in which the action of G given by (5.4) is unitary. We will study this repre- 
sentation through its K-types (i.e., its restriction to K). We first compute 
the infinitesimal representation. 

For X € g, and holomorphic f, we have 


(Ux f)(2) = (Gi), 9 Uowen N 


7 (5) 7 (se) Bon fespt-#x). 2)}. (5.12) 


There is a local action of G°, so this formula remains meaningful also 
for X ¢€ g©. There are three factors to differentiate. For the last one, 
(a) paod (exp(-tX) -z) = —(Xz)f'(z), and we see that exp(tz)-z = 


L % ; ‘na ; 2 
01)" z=2z+t gives x- z= 1; by similar computations, y-z = —2, 


h-z = z. For the first factor, we interchange the differentiations and get 
—n& (X -Z), i.e., 0, 2nz, —n, respectively for x,y and h. 

To differentiate the factor in the middle, we use its expression (5.10). 
First for X = y, we have 


5 =f 0° (exp(—t(tz + 1)~*y)) = “ 7 (exp(—t(tz fie 1)~0°(y)) 
=—2'y) (5.13) 
and 
di|,_o o° (exp(2 log(tz + 1)h)) = = = exp(2 log(tz + 1)0°(h)) 


2z0°(h). (5.14) 
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From these, following the conventions of [7] in defining H, E, F, it follows 
that 


(Ff)(2) = U-yf)(2) = Texp(ty) (2) f(exp(ty) - 2) 


dt 1t=0 


- (—2nzI + 2z0°(h) = 0° (y)) f(z) = micas 
(5.15) 


_— ; ; : 1 ¢t 
Similar, simpler computations give, for g = exp(tx) = ( 0 ‘ 


(Ef)(2) = (Ue _ = —f'(2). (5.16) 
Finally, for g = exp(th) = ei 0 ap , we have 
0 


g-0f2 
Lain =0 (‘ ; ) = exp(-t)0°(h). 


Hence it is not hard to verify that 


(Hf)(z) == (Unf )(z) = (—nI + 0° (A)) f(z) — zf"(2). (5.17) 
Under our hypothesis, we have a reproducing kernel and U is unitary. From 
our computations above, we can determine how U decomposes into irre- 
ducibles. The infinitesimal representation of U acts on the vector valued 
polynomials; a good basis for this space is {e;z” : n > O}; e; is the jth 
natural basis vector in C4. We have H(e;z”) = —(n + j + n)(ej2), so the 
lowest K-types of the irreducible summands are spanned by the ¢;. This 
space is also the kernel of E. So, U is direct sum of discrete series represen- 
tations (U"*, in the notation of [7]), each one appearing as many times as 
—(n +) appears on the diagonal of o(h). 


5.4. The Multiplicity-Free Case 


In order to be able to use the computations of [6] without confusion, we 
introduce the parameter A = 7 + 

From the last remark of the preceding section, it is clear that if U is 
multiplicity-free then o(h) is an (m+ 1) x (m+ 1) matrix with eigenvalues 
Ach Sy A ~ljassg mh So Be Oleg = —(A— 2-17 )6%; (0-9) shows 
that 


mi. ; 
o(h)(o(y)e;) =— ( + 7 + 5+ i) o(y) €;, that is, o(y) €; = const 541. 
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So, e(y) is a lower triangular matrix (with non-zero entries, otherwise we 
have a reducible bundle). The homogeneous holomorphic vector bundle 
determines o(y) only up to a conjugacy by a matrix commuting with o(h), 
that is, a diagonal matrix. So, we can choose the realization of our bundle 
by applying an appropriate conjugation such that o(y) = Sm, the triangular 
matrix whose (j, 7 — 1) element is j for 1 <j <‘m. 

By standard representation theory of SL(2, R), the vectors (—F’)"e; are 
orthogonal and the irreducible subspaces HY) for U are span{(—F)"e; : 
n > 0} for 0 < j < m. There is also precise information about the norms. 

Using this, we can construct an orthonormal basis for our representation 
space. 

For any n > 0, we let u3,(z) = (—F)"e;. 

To proceed further, we need to find the vectors u/,(z) explicitly. This is 
facilitated by the following Lemma. 


Lemma 5.1. Let u be a vector with ue(z) = uez”—*, 0< << mandn>0. 
We then have 


(—Fu)e(z) = (2A —m+l4n)ugz™t*~* + bug_yz™*1 4, 0< 2 < m. 


Proof. We recall (5.15) that —(F’f)(z) = 2Azf(z)+Smf(z)—2zDmf(z)+ 
27 f'(z) for f € H(n), where D,, = —o°(h) is the diagonal operator with 
diagonal {—3,-—} +1,...,}} and S,, is the forward weighted shift with 
weights 1,2,...,m. Therefore we have 


(—Fu)e(z) = (2Aug + Cug_1 — (m — 2€)ug + (n — byug)2"t** 
completing the proof. 0 


Lemma 5.2. For0<7<m and0<l<™m, we have 


P toys 0<f<j-1 
U ZZ) = 
m (R)(G+Ue(QA— mt 2th)", jsb<m, k=l-j, 


where i, 12) is the scalar valued function at the position € of the C™*'- 
valued function ui,(z) := (—F)"e;. 


Proof. The proof is by induction on n. The vectors uJ, are in H(n) for 
0 <j <™m. For a fixed but arbitrary positive integer 7, 0 < 7 < m, we see 
that u?, ,(z) is 0 ifn < €—j. We have to verify that (—Fuj,)(z) = u},4,(2). 
From the previous Lemma, we have 


(—Ful),(z) = (QA —m+l+n+4+ jie + fing gee, 
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where (—Fwu/,)¢(z) is the scalar function at the position @ of the C™*? - 
valued function (—Fu/,)(z). To complete the proof, we note (using k = ¢—j) 
that 


(—Ful,)j+«(z) 
= ((2)G + 1)e(2A — mt 25 + k)n—n(2A — m+ 2j +k +n) + 
(2s)G + De — m+ 25 +k— Dane" 
= (fj +1),(2A -—m+ 27 +k)n—k 
((")(Q\—m+2j+k+n)+(,",)(QA—m+2j+k—1))z"t1-* 
= (f+ 1)e(2\ —m + 25 +k) nk 
(((h) + (21) (2A —m + 29 +4 -1) + (m+1)(GQ))20* 
= (j+1),(2\-—m+2j7+k)n—k 
("t)(QA\-—m+2j+k—-1)+("fP')\(n—k4+1))er* 
= (f+ 1)g(2A— m+ 25 + k)n—e(("E')(2QA — mt 2j +n))ertt* 
= (f+ De((P*) (QA — mt 25+ k)ngi—n)22t* 
es psn?) 
for a fixed but arbitrary 7,0 <7 <mandk, 0< k < m—j. This completes 
the proof. gO 
On H), we have the representation U*’ acting (0 < 7 < m), where 
Aj =A—F+j. Its lowest K-type is spanned by ¢; (= u,) and He; = r;€;. 
By [7, Prop 6.14] we have ||(—F)*e,||? = o7||(—F)*~ te, ||? with 
of = (2\7 +k —1)k 


for all k > 1. (Here we used that the constant q in [7, Eq. (6.33)] equals 
Aj(1 — A;) by [7, Theorem 6.2].) We write 


which can be written in a compact form 


o;, = ((2A3)n(1)n), (5.18) 
where (%)p, = («1+ 1)---(a@+n-—1). We stipulate that the binomial co- 
efficient (j') as well as (x)n—% are both zero if n < k. 


» gt 
The positivity of the normalizing constants (o°, cs _;) * (i 29) is equiva- 


lent to the existence of an inner product for which the set of vectors e?, -; 
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defined by the formula: 


eo} . =(04)-2u)_.(z), n>j, 0<j<m 


n-Jj 
forms an orthonormal set. Of course, the positivity condition is fulfilled if 
and only if 24 > m. 

In this way, for fixed j, each e7,_ ; has the same norm for all n > j. Hence 


the only possible choice for an orthonormal system is {y1;e7,_ gia gf for 
some positive real numbers jz; > 0 (0 < 7 < m). However, we may choose 
the norm of the first vector, that is, the vector el, 0<j <™, arbitrarily. 
Therefore, all the possible choices for an orthonormal set are 


e) AZ) = es AZ : 
Mj a | ) (2— m+ 25)n—; ca | ), (5 19) 


n>j,0<j7 <m,and pj, O< 7 < mare m-+1 arbitrary positive numbers. 

Let us fix a positive real number A and m € N satisfying 2A > m. 
Let H) denote the closed linear span of the vectors {uje,_; 20555 
m, n > j}. Then the Hilbert space HO) is the representation space for 
U defined in (5.2). Since the vectors uJ, L uk as long as 7 # k, it follows 
that the Hilbert space HO is the orthogonal direct sum OF=0 aH) 

fi] 

We proceed to compute the reproducing kernel by using the orthonormal 
system {ujen_; >a > j},0 <7 <m. We point out that for 0 < 2< m, 
the entry le a at the position @ of the vector e _,(z) is 0 for n < 2£. 

J 


Consequently, e,,_; is the zero vector unless n > j. The set of vectors 
{nye}; :0 <j <™m, n> j} is orthonormal in the Hilbert space HO). 


We note that 


e},_5(2) = (enn jz” No » 


n~J n-J 
(eg 2))y 
0, 0<e<j-1 
~ (2A+2j7-—m+k)n—7— (n—j—k+1)~ (j+1)k .n—k . = 
jf Pence [aie GM ork, F< o<m, R= HG. 


We have under the hypothesis that we have a reproducing kernel Hilbert 
space on which the representation U is unitary, explicitly determined an 
orthonormal basis for this space. Now we are able to answer the ques- 
tion of whether this space really exists. For this it is enough to show that 
Sen (z) en(w) converges pointwise, the sum then represents the repro- 
ducing kernel for this Hilbert space. We will sum the series explicitly, and 


Homogeneous Operators 97 


will verify that it gives exactly the kernels constructed in [6]. This will com- 
plete the program of this paper by proving that the examples of [6] give all 
the homogeneous operators in the Cowen-Douglas class whose associated 
representation is multiplicity free. 

To compute the kernel function, it is convenient to set, for any n > 0, 


gee” ne: 0 vT 0 


Gi, t.2) = sig BD o,f eee nn, 0 


Cease FE Ol sea fle 


= Dr(z)G(n)D(y), (5.21) 


where D,,(z), D(s) are the two diagonal matrices and G(n) = (629 Py .j=0 


with ef! ; = Oif € < j or ifn < &. The nonzero entries of ie lower 
triangular matrix G(n), using (5.20), are 


(".7)G + Ie(2A — mt 25 + k)n—j—k 


'@s 7 Fi = 
i+k,j (7) (2A — m + 29) n—jV (nj 


_ VFQA— mt + 23 +6) je 7H k++ Ie G+ De 
7 (2A —m+ 27) (Ding (1)k 


~ {Aga Boe (n=Jj-h+VeG+Ve (5 99 


(2\ —m + 25)k (1)n—j—k (1)k 


for0<k<m-—y. 

Now, we are ready to compute the reproducing kernel Kj for the 
Hilbert space HY) = span{e!_; :n > jt}, 0 < 3 < m. Recall that 
K(z,w) = 9 en(z)en(w)* for any orthonormal basis e,, n > 0. This 
ensures that K is a positive definite kernel. For our computations, we will 
use the particular orthonormal basis e?, ; as described in (5.19). Since there 
are j zeros at the top of each of these: basis vectors, it follows that (¢, p) 
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will be 0 if either 2 < 7 or p < j. We will compute (K;(z,w)), at (¢,p) for 
jg <£,p<™m. For £,p as above, we have 


Co 


S- e} _ o(z)e,_; p(w) 


n>max(E,p) 


= Ss Ge; (0) Ging (n)2" "ww"? 


n>max(£,p) 


(K;(z,w) ep 


We first simplify the co-efficient Gy,j(n)G»,;(n) of z”~*w"?. The values of 
Gj(n) are given in (5.22). Therefore, we have 


Ge,j(n)Gp,j(n) 
_ & +l=j)n—e(n- b+ leg (Ap +P = J )a—p (2 — E+ —) 
7 (2A; )e-j (1)n-e (2A; )p—j (1)n—p 


ye M+ Vem i (9 + Yp-5 
(je; py 
_ 2A; +P = j)n-p(n = + Ye-; (= + = j)p-e(n — p + ae 
(2A; Je- j(1)n—p (2A; 5 ioe j)p—e(n =e 1)p—e 
ye E+ Ve-i (9 + VY p-; 
(1)e-j A) p-3 
_ (2Aj)p—5(2Aj +P = F)n—p(r = € + We-j(n = D+ Vp—5 (GF + Ve—j G+ Vp-i 


(25 )p—j(2Aj e—j 1) n—p(m — p+ 1)p-j (1)e-j (1)p—j 
_ djnag(n — C+ Ne-gln— P+ Vp-s G+ Deng J+ Vp-i 
(2Aj)p—5(2Aj )e—j (Ln (1)e-5 (1) ps 
Theorem 5.2. Given an arbitrary set [io,.-.-, Um of positive numbers, and 


2\ >m, we have 


KOH) (z,w) = nd z,w) = BO) (z, w). 


As a result, the two Hilbert spaces HO) and AO) of [6] are equal. 


Proof. We now compare the co-efficients (1; (z,w))e» with that of a 
known Kernel. Let B* (z,w) = (1 — zw)~2’, where B(z, w) = (1 — zw)~? 
is the Bergman kernel on the unit disc. We let a and 0 denote differentiation 
with respect to z and w respectively. Put 


BOD (2,w) = (89841 — 20)-™ );c,pem- 
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We expand the entry at the position (¢,p) of BO’) (z,w) to see that 
(BO (z,w))ep 


Dele ' 
= >, (2A,;) e— $4 7+ —o+D, 4 —(l—-3) av —(P-J) 


v2>max(l—j,p—j) 


eS ae va ee 
— » a =(n —£+ 1)e_3(n — p + 1)p—42 a7 Oa 
n-j 


where we have set n = m+ j. Comparing these coefficients with that of 
Giej(n)Gp,j(n), we find that 


K,(z,w) = Dj;B)(z, w)D;, (5.23) 


n>max(£,p) 


where D; is a diagonal matrix with oa iS at the (¢,2) position 
with j < ¢< m. Hence K;(z,w) = B*)(z,w) which was defined in the 
equation ([6, Eq. (4.3)]). 

Clearly, we can add up the kernels KA to obtain the kernel K 4) for 
the Hilbert space HO) = Org HY ). Hence the proof of the theorem is 
complete. L 


Corollary 5.1. The irreducible homogeneous operators in the Cowen- 
Douglas class whose associated representation 1s multiplicity free are exactly 
the adjoints of MO) constructed in [6]. 


Proof. In our discussion up to here we proved that the Hilbert space 
HO) corresponding to a homogeneous operator in the Cowen-Douglas 
class has a reproducing kernel given by KO) = SOO’ y2K;, 2A > 
1, [1,---,fm > 0. It follows from the Theorem that the kernels obtained 
this way are the same as (are equivalent to) the kernels constructed in [6]. 
These operators were shown to be irreducible ([6)). O 


We now consider the action of the multiplication operator MO) on 
the Hilbert space HO). Let H(n) be the linear span of the vectors 


a ,e)_,(2), ...,e7_,,(z)}, 
where as before, for 0 < £ < m, e! _,(z) is zero if n — & < 0. Clearly, 
HAH) = 62 4H(n). We have 
2G (un, 2) = Da(2)G(n)D(qs) 
= aaa (n)D(u) 
Dn41(z)G(n + 1)D(p)(D(p)-*G(n +: 1)-*G(n)D(w)). 
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If we let W(n) = D(p)~1G(n+1)~1G(n)D(p), then we see that ze)_,(z) = 
G(p,n+1,z)W;(n), where W;(n) is the jth column of the matrix W(n). It 
follows that the operator MO) defines a block shift W on the represen- 
tation space HO), The block shift W is defined by the requirement that 
W : H(n) > H(n +1) and Win) = Wy. 

Here, we have a construction of the representation space HO“) along 
with the matrix representation of the operator M) which is independent 
of the corresponding results from [6]. 


5.5. Examples 


Recall that G(p,n, z) = Dn(z)G(n)D(p). Once we determine the matrix 
G(n) explicitly, we can calculate both the block weighted shift and the 
kernel function. 

We discuss these calculations in the particular case of m = 1. First, it 
is easily seen that 


(Boma) 0 
G(n) = 7 , (5.24) 


( n yi" (prdans 2" Ce 
2-1 (1)n-1 (1)n-1 
The block W,, of the weighted shift W is 


1/2 
(atts) 0 
<= . (5.25) 


ee ected” Gas” 
ba \ 24-1 (2A+n—1)(2A+n) 2A+n 


Finally, the reproducing kernel KO) with m = 1 is easily calculated: 


1 z 
Cant (wz) 0 0 
ete i 1 1+(2\-1)w + yy i 1 
(aa2) 2\—1 (1—wz)2>F1 (—wz)241 


(5.26) 
One might continue the explicit calculations, as above, in the particular 
case of m = 2 as well. We begin with the matrix 


(2A—2)n 0) 0 


Gin)= | V (ots) (S52) jo 0 
mane VERS) (a= 


(5.27) 
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The block W,, of the weighted shift W, in this case, is 
\ BD 0 0 
(3-3) (expanses) \/ Dxen=T 0 


2 Gao: 2 /42)\(eestem) (es 


(5.28) 
Finally, the reproducing kernel KO“) with m = 2 has the form: 
KOM (z, w) 
a a a 
(1—wz)2A-2 (1—wz)?4-1 (1—wz)? 
-_ v7 1+(2rA—2) wz z(2+(2A—2)wz) 
— | G—w2z)>-1 (2\—2)(1—wz)2 (2\—2)(1—wz)2>41 
w? w(2+(2\—2) wz) 2+4(2\—1)wz+(2A—1)(2A—2) 2? w? 
(1—wz)2 (2\—2)(1—wz)2>F1 (2\—1)(2A—2)(1—wz)2>F? 
0 0 0 
+42] 0 a oe ) ——_ 
My (1—wz) (1—wz)2>F1 
0 2 Gay 25 erase 
0 0 
+p2/0 0 0 (5.29) 
0 0 aaa 
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In his talk at the International Colloquium on Algebraic Geometry held in 
January 1968 at the Tata Institute of Fundamental Research, Grothendieck 
formulated two “standard conjectures” on algebraic cycles, which arose 
“from an attempt at understanding the conjectures of Weil on the ¢ func- 
tions of algebraic varieties” ([4]). At the end of his talk he says “alongside 
the problem of resolution of singularities, the proof of the standard con- 
jectures seems to be the most urgent task in algebraic geometry”. The 
conjectures also form the basis of his theory of “motives”. (See [5], also [1].) 
Unfortunately, the conjectures are not yet proved. 
These conjectures are of two types: 


1) The standard conjecture of Lefschetz type. 
2) The standard conjecture of Hodge type (not to be confused with the 
Hodge conjecture on algebraic cycles, which will be mentioned later). 


We will state the conjectures first in the case of complex non-singular 
projective varieties and later in the case of abstract algebraic varieties. (The 
standard conjecture of Hodge type is known to be true in the complex case, 
from the work of Hodge. The conjecture of Lefschetz type is still unknown 
even in the complex case). 


6.1. The Case of Complex Projective Varieties 


Let X be a compact connected complex manifold of (complex) dimension 
n, embedded in a complex projective space P™(C). An algebraic cycle (of 
codimension 7) in X is a finite formal linear combination Z = }) a; Z; where 
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Z; are irreducible analytic (algebraic) subvarieties of X of codimension 7 
and a; are rational numbers (or integers, depending on the context). Such 
a cycle defines an element of the singular cohomology group H?’(X, Q), by 
associating to an irreducible algebraic variety Z its Poincaré dual [|Z], and 
extending this to cycles in the obvious way. 

To define [Z] one associates to Z its fundamental homology class in 
Ag(n—;)(Z, Z) either by using a triangulation or as in Theorem 3.2 and 
Proposition in 3.3 of [2]; the inclusion of Z in X yields an element of 
FAo(,—;)(X,Z) and [Z] is defined to be the image of this element under the 
Poincaré duality isomorphism H5(,_;)(X,Z) > H?'(X,Z). 

(Another way to define [Z] is to use the theorem of Hironaka on resolu- 
tion of singularities. First, for Z nonsingular, define [Z] to be i,(1) where 
i, is the Gysin homomorphism defined by the inclusion 7: Z > X. If Z is 
singular, let f : Z > Z be a resolution of singularities, with Z nonsigular 
and projective and f an isomorphism on the smooth locus of Z. Define [Z] 
to be (io f).[1]; the class thus obtained is independent of the resolution.) 

The real cohomology class defined by [Z] is the class of the (closed) 
current obtained by appropriately integrating forms of degree 2(n — 7) on 
Z ({2], Proposition in §3.4). 

We denote by A’ the Q-subspace of elements of H?’(X,Q) represented 
by algebraic cycles, and we call A’ the space of algebraic cohomology classes. 

Let us denote by H? the space H?(X, Q). Let € € H? be the cohomology 
class defined by a hyperplane section, and L : H? — H?*t? the operator 
L(u) = € Uu for u € H”, where U denotes the cup product in cohomology. 
We then have the Hard Lefschetz theorem ({10], Corollaire on p. 75): for 
O<p<n, 


L”?: H? = HP 


is an isomorphism. We now state 


The standard conjecture of Lefschetz type. For 0 < 2i<n, 
L"~ . H”* a5 H2"-21 


maps A* onto A”~*. (Note that L"~** maps A’ into A”~*.) 


Remark: The Hodge conjecture is the following: let u € H?’(X,Q); if u 
considered as an element of H?"(X,C) is represented under the de Rham 
isomorphism by a differential form of type (1,7), then u is an algebraic class. 
Let H** c H?*(X,C) denote the space represented by forms of type (i, i). 
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By the Hard Lefschetz Theorem, L"~?' maps H?*(X,Q) M H** isomorphi- 
cally onto H?"~?*(Q)N H"-*"-*, So we see that the Hodge conjecture on 
algebraic cycles implies standard conjecture of Lefschetz type. 

An equivalent form of the standard conjecture of Lefschetz type (for all 
smooth projective varieties) is: the operator A of Hodge theory is algebraic 
([8] Th. 4-1, p. 14). 

For the definition of A see §4 of [8] or p. 76, [9]. Using the Kiinneth 
formula and Poincaré duality, the operator A : H*(X,Q) — H*(X,Q) 
can be considered as an element of H*(X x X,Q), and to say that A is 
algebraic means that this element is an algebraic cohomology class in X x X. 
Grothendieck says that the conjecture seems to be most amenable in this 
form. 

Let X be as above. The kernel of the map 


Lraptl - HP - H2"-Pt+2 


is called the space of primitive forms and is denoted by P? (0 < p< 7). 


Hodge Index theorem. Let X be a complex projective manifold. Then 
on P?' A’ (the space of primitive algebraic classes) the Q-valued symmetric 
form 


(2, y) = (1)! I nUyUer 


is positive definite ({10], p. 77). 
The “abstract” analogue of the above result is the Standard Conjecture 
of Hodge type, described in the next section. 


Homological and numerical equivalence of algebraic cycles. ‘Two 
cycles Z,; and Z2 of codimension 7 in X are said to be numerically equiv- 
alent if the intersection numbers (Z;- Z) and (Z2- Z) are the same for all 
irreducible subvarieties of codimension (n — 7). They are said to be homo- 
logically equivalent if they have the same image in H?’(X,Q). It has been 
a long standing conjecture that numerical and homological equivalence of 
algebraic cycles (with Q-coefficients) coincide. This result can be seen to be 
a consequence of the standard conjecture of Lefschetz type as follows. Since 
the cycle map, which associates to a cycle its cohomology class, takes inter- 
section product into cup product, it is enough to show that the canonical 
pairing 


At ®g An? = Q 
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is non-degenerate. To prove this we define an operator 
*:H? — H*? 


which is an isomorphism and which maps algebraic classes to algebraic 
classes such that (a,b) > { aU («b) is positive definite on A’. If a € H’ and 


a \/ La;, aj € Ppt-23 
is the primitive decomposition (cf. [5], Th. 5 on p. 75) then xa is defined by 


That * maps algebraic classes to algebraic classes follows from the standard 


conjecture of Lefschetz type and positive definiteness follows from Hodge 
index theorem. 


6.2. Standard Conjectures in Abstract Algebraic Geometry 


Let X be a smooth irreducible projective algebraic variety of dimension n 
over an algebraically closed field k, perhaps of positive characteristic p. Let 
¢ be a prime number ¢ ¥ p. We denote by H* the ¢-adic étale cohomology 
group H*(X,Q,) with coefficients in the field Q¢ of ¢-adic numbers, which 
has been defined by Grothendieck ([6]). Let jm denote the group of roots of 
unity of order /™ in k, and set pic = U,,59 Him. For simplicity, we choose 
an isomorphism between jij and Qy/Z,~. (Making such a choice is called 
“a heresy” by Grothendieck!) We then have a cycle map 


Z'(X) @z Q — H” 


where Z*(X) denotes the group of i-codimensional (integral) algebraic cy- 
cles ((6], pp. 23-24). We denote by A’ the image and an element of A’ will 
be referred to as an algebraic cohomology class. Let € € H?(X) be the class 
of a hyperplane section. From the work of Deligne it follows that for m <n 


ue . H™ = H2n-™ 


is an isomorphism (Hard Lefschetz, [3], Theorem 5.5). We can now state 


The standard conjecture of Lefschetz type. For 2i <n, 
yer . HH? = Hen 22 


maps A’ isomorphically onto A". 
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For m < n we denote by P™ the primitive part of H™, defined as the 
kernel of the map 


ener ‘ H™ = H2r-™+2 


The standard conjecture of Hodge type. On P*' A’ the Q-valued 
symmetric form 


(a,b) = (-1)'K(aUbuUEe™ *) 


is positive definite, where K denotes the isomorphism H?” — Qy,. 


Remark: If both these conjectures are true then numerical and homolog- 
ical equivalence of algebraic cycles would coincide, as indicted above in the 
case of complex projective varieties. 


Remark: A priori it is not clear that A’ are finite dimensional vector 
spaces over Q. This would be the case if numerical and homological equiva- 
lence coincide, since the space of algebraic cycles modulo numerical equiv- 
alence is a finite dimensional vector space ({1], 3.5 and [8], Lemma 5.2). 


Remark: As in the complex case, the standard conjecture of Lefschetz 
type (for all varieties X) is equivalent to: the operator A is algebraic. 


Weil conjectures. As mentioned earlier, a motivation for standard con- 
jectures was to understand Weil conjectures (on rational points of varieties 
defined over a finite field). 

Let X be an irreducible smooth projective variety of dimension n, de- 
fined by polynomials with coefficients in a finite field F,. For each m > 1, 
let Vy, denote the number of points with coordinates in the extension field 
Fam. Define the function Z(t) by: 


log Z(t) = Som t™/m. 
m>1 
The function Z(t) is called the zeta function of X (The function Z(q~*), 
denoted by ¢(s), is also sometimes considered.) 
Among the conjectures of A. Weil on the zeta function are the following. 


1) Z(t) is a rational function. 
2) Z(t) is of the form 
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Pi(o) *** Poy 16) 


a P(t) «+ Pon(€) 


where P;(t) a polynomial with integer coefficients with Po(t) = 1—t and 
Py, (t) = (1 — q"t). The absolute value of the roots of P,(t) is q~*/?; i.e., 
P,(t) is of the form [],(1 — wijt) with | wi; |= q‘/?: (Riemann hypothesis). 


Remark: Moreover, if X is a non-singular algebraic variety defined over 
a number field K, then for almost all prime ideals p of K, the degree of 
the polynomial P; associated with Xy (the reduction of X mod p) coincides 
with the (usual) ith Betti number of the complex variety X. This gives 
a method of computing Betti numbers of complex projective varieties by 
computing rational points on varieties defined over finite fields. 

We will now indicate how the Riemann hypothesis (for varieties over a 
finite field), which has been proved by Deligne ({3]), is related to standard 
conjectures. Arguments due to Weil and Serre ([4]) yield (compare [9], 
Theorem 5.6 of [8], and [1], 5.4.3, p. 58): 


Proposition 6.1. Assume that the standard conjecture of Lefschetz type 
holds for X and the standard conjecture of Hodge type holds for the product 
X x X. (X smooth projective). Let f : X — X be a morphism such that 
f*(§) = g€ where € is the cohomology class of a hyperplane section and 
q€Qq>0. Then the eigenvalues of the endomorphism f\|,7: induced on 
the ith ¢-adic cohomology group H'(X, Qe) are algebraic integers of absolute 
value q'/?. 


Now let X be defined over a finite field Fy and let f : X — X be the 
Frobenius morphism 


(01-2) > (af 2%) 
raising the coordinates to q‘” powers. Note that the number of rational 
points in Fgm is the number of points left fixed by the m*” iterate f™, and 
the latter can be computed in terms of the trace of the action of f’ on 
H’ by using the Lefschetz fixed points theorem (extended to ¢-adic étale 
cohomology). One then finds that 


P;(t) = der al = tf)|H- 


On applying the previous proposition to f, we see that the absolute value 
ae is gl? (if we assume that the standard conjectures 


of an eigenvalue of f 
have affirmative answers). 
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Chapter 7 


On the Classification of Binary Shifts on the Hyperfinite 
IT, Factor 


Geoffrey L. Price 
Department of Mathematics, 
9E, United States Naval Academy, 
Annapolis, MD 21402, USA* 
glp @usna.edu 


We provide a complete classification up to conjugacy of the binary shifts 
of finite commutant index on the hyperfinite JJ, factor. There is a nat- 
ural correspondence between the conjugacy classes of these shifts and 
polynomials over GF(2) satisfying a certain duality condition. 


7.1. Introduction 


Let R denote the hyperfinite /J, factor. A pair of *-automorphisms o, p on 
R are said to be conjugate if there exists a *-automorphism y on R which 
satisfies yoa(A) = p07(A) for all A in R. The notion of conjugacy carries 
over to the setting of unital *-endomorphisms on R. In this situation it turns 
out that the Jones index [R : o(R)]| of the subfactor a(R) in R is a numerical 
conjugacy invariant, as is the commutant (or relative commutant) index: 
this is the first positive integer k (or co) for which the relative commutant 
algebra o*(R)' 29 R is nontrivial. 

In [12] R. T. Powers introduced a family of unital *-endomorphisms 
on R known as binary shifts. The range o(R) of each binary shift o is a 
subfactor of index 2. As a result the minimal possible commutant index for 
a binary shift is 2 (cf. [7]). Powers has shown [12] that there exist binary 
shifts of any specified commutant index k € {o00, 2,3,...}. In particular he 
showed that for any finite commutant index there are at most countably 
many conjugacy classes, but that there are uncountably many conjugacy 
classes having infinite commutant index (see Theorem 7.4). 
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In [15] we gave a complete classification of the conjugacy classes of 
binary shifts of commutant index 2. We showed that there is a natural cor- 
respondence between the conjugacy classes of these shifts and polynomials 
with coefficients in GF(2) which satisfy a certain duality condition (see 
Theorem 7.6). These are the polynomials which have constant coefficient 
1 and which have no self-reciprocal factors (see Definition 7.2) of degree 
greater than 1. In this paper we extend the results of [15] by establishing 
a correspondence between binary shifts of finite commutant index k > 2 
and polynomials over G'F'(2) with constant coefficient 1 which have no self- 
reciprocal factors of degree exceeding 2k — 3. Unlike the case for k = 2, 
this correspondence for higher commutant indices is not one-to-one, but we 
produce a recursion formula (4.1) which relates the number of conjugacy 
classes of binary shifts of commutant index k which are associated with 
each of the polynomials of the form described above. As a consequence we 
can provide precise information, for example, on the number of binary shifts 
of fixed finite commutant index which are associated with any irreducible 
polynomial p(x) over GF(2). 

There is another notion of equivalence among unital +-endomorphisms 
on R, in addition to conjugacy, which is known as cocycle conjugacy. A 
pair a and ( of unital x-endomorphisms are said to be cocycle conjugate 
if there exists a unitary u € R for which ao Ad(u) and @ are conjugate. 
In [20] it was shown that all binary shifts of commutant index 2 are cocycle 
conjugate and partial results were obtained in [21] for binary shifts of higher 
finite commutant index. 

Since their introduction binary shifts have been used by several authors 
to illustrate various phenomena. In [10] the authors constructed a binary 
shift o on the hyperfinite JJ, factor R such that o has a unique invariant 
state ¢@ and for which the Connes-Stgrmer entropy (cf. [4]) hg(o) is 0. 
On the other hand hgag(o ® a) = log(2), which shows that the additive 
tensor product formula for the Connes-Stgrmer entropy fails in general (see 


also [6] and [9]). In [18] it was shown that if o is a binary shift with finite 
1 


5log(2) where 


commutant index then its Connes-Stermer entropy h;(c) is 
T is the unique tracial state on R. 

It is a pleasure to thank Alexis Alevras and Robert T. Powers for helpful 
conversations. We are also grateful to Professor Powers for writing a very 
enlightening computer program related to the classification of binary shifts. 

This paper appeared in an abbreviated form in the Proceedings of the 


National Academy of Sciences, see [19]. 


Classification of Binary Shifts 113 


7.2. Preliminaries 


In this section we present Powers’ construction of the binary shifts on R. We 
also state some of the results which are known about binary shifts and which 
are relevant to the classification of their conjugacy classes. In particular we 
shall exploit the close connections that have been made between the theory 
of recurring linear sequences and binary shifts. See [8], Chapter 6, for an 
extensive bibliography on the subject of recurring linear sequences. 

A spin system is a sequence of self-adjoint unitary operators {u; : i € 
Z*} that commute pairwise up to a phase in the sense that 


UjU;j = Aggthg th 


for complex numbers \;;, where i,j € Z*. Since the u; are self-adjoint, 
u? = I and so it follows that Aij = Aj; = +1 for all i and 7. 

Now given a subset X (finite or infinite) subset of N, and its charac- 
teristic function g : N — {0,1}, we can use g to define a spin system as 
follows. Let {uo,ui,...} be a sequence of hermitian unitary operators, or 
generators, which satisfy the commutation relations 


UjUi4j = (—1)9P uss jus, 4, 9 E ZG. (lel) 


Note that the commutation relations in (1.1) are translation invariant, 
i.e., the equations above are independent of the choice of subscript 7. 

One may define words in the generators by setting, for finite ordered 
subsets J = {Jo,J1,---;Jm} of distinct nonnegative integers, u(J) = 
Ujy Uj, °**Uys,,, and u(@) = I, the identity. In fact, since the u,’s are hermi- 
tian and satisfy (1.1), any product of the generators may be rewritten as 
either +u(J) or —u(J) for some finite ordered subset J C Z*. For n € N, 
let 2, be the finite-dimensional group algebra over C consisting of lin- 
ear combinations of the words in the generators {uo, Ui,...,Un—1}. Note 
that 2, has dimension 2”, the number of words in the generators. Since 
A, C Anii for all n € N, one may obtain an AF'—algebra by taking the 
uniform closure of the union U°°,2,,. Following the terminology of [12], 
Definition 3.2, we refer to 2 as the binary shift algebra associated with X. 
The binary shift itself is the unital *-homomorphism on 2 defined uniquely 
by the mappings o(u;) = uj41 on the generators. We shall refer to X 
as the anticommutation set, to a = {ao,a1,...}, where a; = g(j), as the 
bitstream, and to a = {...,@2,@1,@0,@1,@2,...} as the reflected bitstream 
associated with oa. 


114 G. L. Price 


If the reflected bitstream is periodic it is easy to see that 2( has a non- 
trivial center. In fact, if 4 has period length p the word uguy (as well as its 
shifts) lies in the center. On the other hand, 2 has trivial center if a is not 
periodic. 


Theorem 7.1. (cf. /12/, Theorem 3.9, [16], Theorem 2.3, [18], Corollary 
5.5) Let & be the binary shift algebra with anticommutation set X and corre- 
sponding bitstream given by a; = g(j),j € Z*. Then the folloing conditions 
are equivalent. 


(i) a ts not periodic. 
(ii) The center of % consists of scalar multiples of the identity. 
(iii) For any nontrivial word w, wu; = —ujw for some u;. 
(iv) 2 has a unique normalized trace. 
(v) 2 is isomorphic to the UHF algebra of type 2°. 


Hence if any of the above conditions hold then with respect to the unique 
trace on 2, the weak operator closure of 2 is isomorphic to the hyperfinite 
II, algebra R. 


(For the structure of the algebras generated by more general spin sys- 
tems see [1].) 

Assuming any of the conditions of the theorem hold the mapping o : 
Uj — Us41 for all i € Z* uniquely defines a binary shift on R. 


Theorem 7.2. (cf. [12/, Theorem 3.6) A pair of binary shifts on R are 
conjugate if and only if they are defined via the same bitstream. 


Observe that the trace 7 satisfies T(u(J)) = 0 for any nontrivial word 
w = u(J): for by condition (iii) there is for any word w a generator u,; such 
that 7(w) = 7(uj;?w) = T(ujwu;) = —T(w). 

It is straightforward to see that the binary shift o on 2 extends to a 
unital *-homomorphism on R which we shall also denote by a. Then we 
have the following result for binary shifts on R. 


Theorem 7.3. (cf. /12/, [7], Example 2.3.2) For any binary shift on R the 
subfactor o(R) has Jones index |[R: o(R)] = 2. 


As aconsequence of the previous theorem the minimal commutant index 
for a binary shift is 2, [7], Corollary 2.2.4. On the other hand, if X = {k—1}, 
then uo € o*(R)'N R,o*1(R)'N R = CLI, so the associated binary shift 
has commutant index k. Hence there exist examples of binary shifts for any 
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finite commutant index k > 2. The following theorem characterizes those 
binary shifts with finite commutant index. 


Theorem 7.4. (cf. [2/, Theorem 2.1, [17], Theorem 5.8) A binary shift on 
R has finite commutant index if and only if its bitstream a is eventually 
periodic. For a binary shift o of finite commutant index k and generators 
uj,j © Zt, there is a word w = u(jo,j1,---,Jjn) such that 


(i) jo =0, «.e., w “starts” with uo, and 
(ii) for any s > 0, o**8(R)' OR is a 2°—dimensional algebra generated by 
the words w,a(w),...,0"(w). 


Definition 7.1. Let o be a binary shift on R with generators u;,j € Z™. 
Let z be a word in the generators, and let p(x) = co + c1@ + --- + Cp” 
be a polynomial with coefficients in GF(2), then < z,p > is the word 
gg gy seme (2)? Toy Fe 


Suppose p(x) is the polynomial for which w =< uo,p > in the preced- 
ing theorem. In the study of binary shifts of commutant index 2 in [15] 
we described connections among polynomials p(x) words w =< uo,p > 
generating relative commutant algebras, and conjugacy classes of shifts. 
Theorem 7.6 describes this connection. In order to state the theorem we 
need to identify polynomials which possess a special symmetry. 


Definition 7.2. A polynomial p(x) with constant term 1 is called reciprocal 
or self-reciprocal if its coefficients are flip-symmetric (see [3], [8]), i-e., p(x) = 
Cot Ce + + tent” = Cy t+ Cn-14 +--+ + C02”. 


Remark 7.1. Note that if a polynomial p(x) = cp + cpu +--+ +¢n2" with 
constant coefficient 1 then p*(x) = x"p(4) = cn +en-14+-+++c92". Hence 
p(x) is reciprocal if and only if p(a#) = p*(z). 


The following results about reciprocal polynomials will be used in the 
next section. 


Lemma 7.1. Jf h(x) is a polynomial with constant coefficient 1 then 
h*(ax)h(a) is reciprocal. The product of reciprocal polynomials is reciprocal. 


Proof. Obvious. LJ 


Theorem 7.5. (see /15/, Theorem 4.8) Any polynomial p(x) has a unique 
reciprocal divisor of maximal degree. 
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Theorem 7.6. Let p(x) be a polynomial over GF(2) with constant coef- 
ficient 1. Then there exists a binary shift of commutant 2 and generators 
uj,j € Z* such that < uo,p > generates o*(R)' OR if and only if p(x) has 
no reciprocal factors of degree exceeding 1. Moreover there is a one-to-one 
correspondence between such polynomials and the family of binary shifts of 
commutant index 2. 


It is possible to show that among all polynomials of degree n > 3 and 
constant coefficient 1 there are 2"~? which satisfy the hypotheses of the 
theorem, (see [15], Theorem 4.4). Hence there are countably many conju- 
gacy classes of binary shifts of commutant index 2. Corollary 7.7 establishes 
the same conclusion for any finite commutant index. 

Although we are interested in analyzing binary shifts on R it is useful for 
that purpose to understand the structure of the AF—algebras [U°@, 2,]~ 
which have nontrivial centers (see the proof of Theorem 7.10). 


Theorem 7.7. Let a be a bitstream for which a is periodic. Let 2% be the 
corresponding AF'— algebra, and let rT be the trace on 2 which vanishes on 
nontrivial words. Let M be the von Neumann algebra obtained by completing 
2 in the weak operator closure with respect to T. Then there exists a word 
w =< U0, p >= Upruy' --- uc such that 


(i) Co = 1, 
(ii) p(x) is reciprocal, and 
(iii) the center of M is generated by w and its shifts. 


Hence the center of M is isomorphic to the algebra of continuous functions 
on the Cantor set. 


Suppose a is a bitstream in GF (2) with a9 = 0. Then for each n € N 
one may construct an n x n Toeplitz matrix A, whose first row consists of 
the first n elements of a, viz., 


do a) a2 Ag... An—1 
a, a a, a2 An—2 
An=|  @ 4 41... @n—3 (1.2) 
An—1 An—2 An-3.---.-.-- ao 


Since A,, is a skew-symmetric matrix it has even rank (this holds true 
even for matrices over GF(2), (see [11], Theorem IV.11). Considered as 
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a sequence of matrices, the Toeplitz matrices associated with a bitstream 
exhibit a remarkable property. 


Theorem 7.8. (cf. [13], Theorem 5.4) Let v(A,) denote the nullity of the 
nxn Toeplitz matriz above. If a is not periodic the sequence {v(A,) :n € N} 
consists of the concatenation of strings of non-negative integers of the form 
1,2,...,m—1,m,m—1,...,2,1,0. [fa is periodic then the nullity sequence 
consists of finitely many strings of the above mentioned form followed by 
the sequence 1,2,.... 


Theorem 7.9. (cf. [5/, Corollary 2.10, see also [22]) For any even positive 


integer n there are 2-2 nxn invertible Toeplitz matrices of the form above. 


Finally it will be helpful to use the following properties of the operations 
< z,p > for a word z in the generators of a binary shift on R and for 
polynomials p(x) with coefficients in GF'(2) (see [16], Section 4): 


22> 29> Hot = wp gs (2.8.,) 
<< WS, ¢>S2 < WwW o7 > (1.3.2) 


7.3. Bitstreams and Polynomials 


In this section we prove some elementary results about bitstreams over 
finite fields which are based on some well-known results from the theory 
of linearly recurring sequences, (see [8], Chapter 6). Our results stem from 
important connections which exist between eventually periodic bitstreams 
with entries in a finite field and polynomials with coefficients in the same 
field. Here we deal exclusively with the finite field GF'(2). We shall say that 
a polynomial p(x) = co + c1.u +-+++ nx” annihilates a bitstream a if for 
any j € Zt, 3), ciaj41 = 0. If a bitstream is eventually periodic, i.e., if 
there exists a positive integer s such that a, = ax+4, for all k > N, some 


N+s 


N, then the polynomial 2% +2 annihilates a. In particular one has the 


following result. 


Lemma 7.2. (cf. [8], Theorem 6.11) A bitstream a over GF (2) is eventu- 
ally periodic if and only if it is annihilated by some polynomial p(x) with 
coefficients in GF(2). a is periodic if and only if it is annihilated by a 
polynomial with constant coefficient 1. 
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If a and b are bitstreams then define addition by a+b = ec where 
c; = a; +b; for all 7 € Z*. Defining scalar multiplication in the obvious 
way, one sees that the set of periodic (respectively, eventually periodic bit- 
streams) forms a vector space over GF'(2): for if a is periodic (respectively, 
eventually periodic) with period s and b is periodic (respectively, eventu- 
ally periodic) with period t, then c is periodic (resp., eventually periodic) 
with period a divisor of st, (see [8]). The same is true for doubly-infinite 
periodic sequences, and also for the subspace which consists of the reflected 
bitstreams 4 = {...,@2, 41,0, @1, @2,... },a@9 = 0, which happen to be pe- 
riodic. Thus we have: 


Lemma 7.3. The set of periodic reflected bitstreams forms a vector space 
over GF (2). 


Definition 7.3. (cf. [8], Section 6.5) For a polynomial p(x) with coefficients 
in GF(2) let S(p) be the vector space of periodic reflected bitstreams an- 
nihilated by p(x). 


Proposition 7.1. Suppose a is a periodic reflected bitstream in S(p). If 
r(x) is the maximal reciprocal factor of p(x) then a € S(r). 


Proof. Since the reflected bitstream a is symmetric about its entry ag = 
0, it is clear that p(x) annihilates a if and only if p*(x) does, too (see 
Remark 7.1). Hence a € S(p) S(p*). But for any pair p,q of polynomials, 
S(p)N S(q) = S(ged(p,q)), by [8], Theorem 6.54. Clearly r divides the 
polynomial d = gcd(p, p*). Suppose p(x)/r(a) = [][ hi (x), where the h;'s 
are irreducible, then clearly d(a) = r(a)-ged ([] hi(x), [] hi*(x)). But if h(x) 
is an irreducible factor of both []h;(x) and [|] h;*(x) then h(a) = h;(a) = 
h;*(x) for some i and j. But then by Lemma 7.1, hjh; = hi(x)h,* (x) 
is reciprocal, which contradicts the maximality of the degree of r. Hence 
r = gcd (p, p*) = S(p) 1 S(p*) annihilates a. O 


Proposition 7.2. Let r(x) be a reciprocal polynomial with constant coeffi- 
cient 1 and degree either 21 or 21+ 1,1 € Z*, The vector space of periodic 
reflected bitstreams annihilated by r(x) has dimension lI. 


Proof. Let é = {...,c2,C1,C0,C1,C2,...} be a periodic reflected bit- 
stream. Since r is reciprocal of degree 2/ or 2/+ 1, and € is symmetric about 
co, r annihilates ¢ if and only if it annihilates {c,cj-1,...,¢1,¢0,C1,--- }- 
Since degree(r)€ {21,21 + 1}, it is clear that c1,c2,...,cy may be chosen 
arbitrarily but that cj+1,ci+2,... depend on the choice of c; through c.0 
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Corollary 7.1. If r(x) is a reciprocal polynomial with constant coefficient 
1 and degree either 21 or 21+1,1 € Zt, there are exactly 2' periodic reflected 
bitstreams annihilated by r(x). 


Definition 7.4. Let k > 2 be a fixed positive integer. Let w = 
Up uy: Ucn, with co = 1, be a word in the generators u;,j € Zt of 


a binary shift o. Then w is called a qkword if w € o*(R)'A R but 
w éo® (ROR. 


Remark 7.2. Suppose a is a bitstream and w is a word of the form 
above. Using (1.1) repeatedly it follows that for j € Zt, wu; = 
(—1)%coFaj—-1e1 tt F4in—j\ny-w, Hence w is a qkword if and only if there 
exists a bitstream a satisfying the following linear system. 


Ap—1C0 + Ap—2C1 + Ap—3C2Q + +++ + Ain—K-1/Cn = 1 
AnCo + Ap—1C1 + Ap—2€2 +++ + Ajn—KICn = O (2.1) 


Ak+1CO + ApCy + Ap—-102 + +++ + Gn—K41/Cn = O 


The first equation holds since w must anticommute with uz; and the 
remaining equations hold since w commutes with uz, Up+1,..-- 


Remark 7.3. As [R: o(R)] = 2,0(R)’N R=CI ([7], Corollary 2.2.4), so 
there are no qlwords. 


We shall see below that for fixed values co,c1,...,Cn it is possible to 
have more than one bitstream a which satisfies (2.1). For that reason we 
shall need the following terminology. 


Definition 7.5. Let p(x) = co +c@ +-+:+ nx” be a polynomial with 
constant coefficient 1. If the system (2.1) is satisfied then p(x) is said to 
meet a at the integer k. If there is an integer k for which p(x) meets a then 
we say that (p,a) are a binary pair. 


Remark 7.4. Note that if (p,a) meet at k then if o is the binary shift on 
R with generators u;,j € Z* and bitstream a, w =< uo,p >€ o* (RJM 
R, w ¢ o®1(R)' A R. Hence w is a qkword and o is a binary shift of 
commutant index < k. On the other hand, since 2 is the minimal possible 
commutant index for a binary shift, if a binary pair (p,a) meet at 2 then 
o has commutant index 2. 
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Theorem 7.10. Let k > 2 and n> 2k —2 be fixed integers. Then for any 
polynomial p(x) of degree n and constant coefficient 1 there are at most 
2*-2 distinct bitstreams a = {ao,a1,...} which meet p(x) at k. 


Proof. Induction on k. Using the remark above, a restatement of Theo- 
rem 7.6 shows that any polynomial of any degree n with constant coefficient 
1 can meet at most one bitstream a at the integer 2. Suppose the assertion 
holds for 7 = 2,...,k —1. Suppose there is a polynomial p(x) of degree 
n > 2k — 2 with constant coefficient 1 which meets some bitstream a at k. 
Let o be the corresponding binary shift which, by the preceding remark, 
has commutant index < k. Let u;,j € Zt be the generators for o. 

Now suppose p is any other binary shift, with generators {v; : 7 € Zt} 
and bitstream b, such that < vp, p > is a qkword. Let s = a+b. Note that 
since p(a) meets a at k, < uo, p > is a qkword for o. Hence the coefficients 
Co,;---,€n Of p(x) satisfy the infinite linear system (2.1), as well as system 
obtained from (2.1) by replacing each a; with b;. Hence [co, ..., Cn] satisfies 
the linear system of equations. 


Sk—-1Co + Sk—2C1 + SK-3C2 + +++ + Sin—K-1]Cn = O 
SKCo + Sk—-1C1 + Sh—-2€2 + +++ + Sin—K|/Cn = O (2.2) 


Sk41C0 + $~C1 + $K-102 + +++ + Sin—K41/Cn = O 


It follows that if zo, 21,... is a family of hermitian generators satisfying 
the commutation relations given by s and if 7 is the shift on the von Neu- 
mann algebra M generated by the z,’s, then either z =< zo, p > commutes 
with all of the z;’s and lies in the center of M (in which case the re- 
flected bitstream a of a associated with 7 is periodic, Theorem 7.7), or M 
is the hyperfinite JJ, factor R, 7 is a binary shift on R, and there is a 
j € {2,3,...,k—1} such that z € 79(M)'NM,z ¢79-1(M)'NM, ice., that 
z is a qjword. 

Now let s@,s@),...,s8(™ be the list of all bitstreams s satisfying one 
of the following two conditions with respect to the fixed polynomial p(z): 
either (i) the reflected bitstream $ of s is periodic and annihilated by p, or 
(ii) p(w) meets s at j for some j € {2,...,4 —1}. Then by the preceding 
paragraph it follows that the bitstreams b which meet p(x) at the integer 
karea,ats®,...,a+s(™, 

Let r(x) be the maximal reciprocal factor of p(x). If r(x) has degree 
either 0 or 1 then by the preceding lemmas there are no periodic reflected 
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bitstreams a4 annihilated by r(a) (and hence by p(x), Proposition 7.1, 
Proposition 7.2) other than a= {...,0,0,0,...}. Hence the nontrivial bit- 
streams s in the previous paragraph meet the polynomial p(2) at some 
integer 7 for 2 < 7 < k—1. By the induction assumption there are no 
more than 2/~? bitstreams which meet p(x) at each such j. Then from the 
preceding paragraph, the number of bitstreams b which meet p(x) at k is 
no greater than 1 + ae: a ae 

Next suppose that r(a) satisfies 2 < deg(r(a)) < 2k—2. Write deg(r(z)) 
= 2l or 21+ 1. By Corollary 7.1 the number of periodic reflected bitstreams 
4 annihilated by r(z) is 2'. By Proposition 7.1 these are exactly the periodic 
reflected bitstreams which are annihilated by p. By Lemma 7.5 below there 
are no bitstreams which meet p() at j for 1 < 7 < 1+ 1. On the other 
hand, by the induction assumption there are, for 1 + 2 < 7 <n, no more 
than 2/~? bitstreams which meet p(x) at j. Then the number of binary 


shifts which meet p(a) at k cannot exceed 2! + es 7a. 


7.4. Counting Polynomials with Symmetry 


In this section we complete the analysis necessary to enumerate the binary 
shifts of finite commutant index on the hyperfinite IJ, factor R. As in [15], 
where a classification was made of the binary shifts of commutant index 2, 
we establish a natural connection between binary shifts of finite commu- 
tant index and polynomials over GF'(2) which satisfy a certain symmetry 
condition. In the course of making this connection, it is convenient first to 
study the family of binary pairs which meet at a fixed integer k > 2 (see 
Definition 7.5) and subsequently to match these pairs with binary shifts. In 
Lemmas 7.4 and 7.5 we show that a polynomial p(x) with constant coeffi- 
cient 1 meets a bitstream at k if and only if p(x) has no reciprocal factors of 
degree > 2k — 2. Using these results we are able to provide a recursion for- 
mula which counts, for each polynomial p() above, the number of binary 
shifts o of commutant index k, with generating family u;,j7 € Z* of her- 
mitian unitaries for which < uo,p > generates the first nontrivial relative 
commutant algebra o*(R)!O R. 

Recall that a polynomial p(7) = cp + c1@ +--+ + cy,2” with constant 
coefficient co = 1 meets a bitstream a at an integer k if and only if the coef- 
ficients of p(x) satisfy the following infinite system (3.1) of linear equations 
over F’. The first equation holds since w =< uo,p > must anticommute 
with uz_1, and the remaining equations are satisfied since w commutes 
with the generators ux, Up41,-... Ifn < 2k —2 then |Jn—k-—1| < k—1 and 
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we observe, since cy = 1, that for any choice of a1, a2,..., @x—2, there exists 

one and only one choice of az,ax%41,.-. such that the system holds. Hence 

w is a qkword which corresponds to exactly 2*~? distinct bitstreams a. 
For n = 2k — 2 the system above becomes 


Qe—1Co + Ap—2C1 + Qp—3Cq + +++ + Gp—1C2x~-2 = 1 
AnCo + Ap—1C1 + Ap—2Co + +--+ Gp_2Cop-2 = 0 (3.1) 


Qj4-1Co + Agcy FOR_1€7 +: + Gp—gCop—o = 0 


If the first equation holds then clearly there is one and only one choice 
for each of the remaining terms az, a~41,... to satisfy the system of equa- 
tions. Note that the first equation may be rewritten as agcx—1 + a1(cK + 
Ch—2) + Go(Ck+1 + Ck—3) + O3(Ck+2 + Cea) +°°+ + Gx—1(Con—2 +0) = 1, 
or (since a9 = O and cox-2 + co = 0), as ar(cK + Ce-2) + Go(CRy1 + 
Ck-3) + a3(Cke+2 + Cha) +--+ + Qx—2(Con—3 +€1) = 1. Hence if aj,..., ax—2 
satisfy this equation then a,—; may be chosen arbitrarily. If p(a) is re- 
ciprocal, however, the left side of this equation is necessarily 0 and so 
no solution is possible. On the other hand, if p(x) is not reciprocal then 
[Ch + Ch—2, Ch+1 + Ck—3,Ck+2 + Ck—4,---,Cor-3 + 1] 4 [0,0,...,0]. Hence 
there is at least one solution to the initial equation above. By viewing the 
initial equation as a nonhomogeneous linear system of 1 equation in k — 2 
unknowns aj, d@2,...,@—2, since there is at least one solution, there are 
therefore exactly 2’~? solutions [a1,a2,...,@,—2] to this first equation. As 
noted above, the entry a,—; may be chosen arbitrarily but a,,ax%41,... are 
determined by their predecessors, hence there are 2"~? bitstreams satisfy- 
ing the system for each polynomial which is not reciprocal. Note that there 
are 27-1  Qh-1 — 92k-38 _ ok-1 — 97-1 _. gn -F+1 nolynomials of desreé n 
with constant coefficient 1 which are not reciprocal. Hence we have shown 
the following. 


Lemma 7.4. If k > 2 and n < 2k — 2, any polynomial of degree n with 
constant coefficient 1 meets at k with 2*~? distinct bitstreams. If n = 2k —2 
then 2"-!—2”"-F+1 polynomials of degree n with constant coefficient 1 which 
meet at k with some bitstream. These are the polynomials which are not re- 
ciprocal. Each of these polynomials meets at k with 2*~? distinct bitstreams. 


The analysis pertaining to polynomials of degree exceeding 2k — 2 is 
considerably more difficult, and we devote the remainder of this section to 
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studying this case. As we shall see below, whether a polynomial of degree 
n > 2k—2 meets a bitstream at the integer k is determined by the degree of 
its maximal reciprocal factor. Polynomials with reciprocal factors of high 
degree will not correspond to qkwords, and therefore we will be led to 
counting the number of polynomial of fixed degree n > 2k — 2 having 
maximal reciprocal factors exceeding a certain degree (see Theorem 7.12). 


Lemma 7.5. Let k > 2. No polynomial p(x) with coefficients in GF (2), 
constant coefficient 1, and a reciprocal factor of degree > 2k —2 meets at k 
with any bitstream. 


Proof. Suppose p(x) = q(a)r(x) where r(x) is a reciprocal polynomial of 
degree m > 2k — 2, and suppose p(x) meets at k with some bitstream a. 
Then there is a binary shift o with generators u;,j € Z*, say, such that 
w =< uo,p > is a qkword for o. Let z =< uo,q >, then w =< uo, p >=< 
uo,gr >= + << uo,g >,r >= + < z,r >. It is straightforward to see 
that if z; = o/(z),j € Z*, then o restricts to a binary shift on the von 
Neumann algebra M generated by the z;’s and w € o*(M)'N M but w ¢ 
o*-1(M)'NM. Hence w anticommutes with z,_, so that if b is the bitstream 
for the restricted binary shift on M and r(x) =Ib +ha+---+lnz™, 


bp—ilo + be_-aly +--+ + Oana = 1. (3.2.1) 
If m = 2k — 2 then since r(x) is reciprocal (3.2.1) becomes 
be—1lo + Dp—2la + +++ + Oilp—2 + bole—1 + bilg—2 + be-1lo =0, (3.2.2) 


a contradiction. If m > 2k — 2 then since r(x) is reciprocal (3.2.1) may be 
rewritten 


be—1lm + Ok—alm—1 +°°° + Om—k4ilo = 1, (Buds8) 


which implies that w anticommutes with 2,—~41, also a contradiction (since 
m—k+1 > kandw € o*(M)'NM). By contradiction w cannot be a qkword. 
Equivalently, p(a#) does not meet at k with any bitstream. O 


Remark 7.5. We shall see below (Corollary 7.4) that for n > 2k — 2 all 
other polynomials with constant coefficient 1 meet at k with at least one 
bitstream. In fact, they meet at k with exactly 2*~? bitstreams. 
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Below we shall count the number of polynomials of fixed degree n > 
2k — 2 which have maximal reciprocal factors of degree > 2k — 2. In order 
to make this calculation we require both a definition and a result from [15]. 


Definition 7.6. (cf. [15], Definition 4.1.) A polynomial f(z) € F[z] with 
constant coefficient 1 is completely free if f(x) has no reciprocal factors 
except for the constant polynomial 1. z(n) denotes the number of completely 
free polynomials of degree n with constant coefficient 1. 


Theorem 7.11. (cf. [15], Theorem 4.4.) Let r > 1 be a fixed integer. If 
n = 2r, then z(n) = 3(2-47-1 +4) —2 and ifn = 2r +1 then z2(n) = 
3(4” — 4) +2. 


Theorem 7.12. With the same notation as above, there are, for n > 2k — 
2, exactly 2"-* polynomials of degree n with constant coefficient 1 whose 
maximal reciprocal factor has degree > 2k — 2. If n = 2k — 2 there are 
2r—(kK-1) — 2k-1 such polynomials. 


Proof. For n > 2k — 2 let r(n) denote the number of polynomials satis- 
fying the hypotheses of the theorem. For j € N let s(n) denote the number 
of reciprocal polynomials of degree n with constant coefficient 1. Obviously 
if nm = 21 or 21+ 1, s(n) = 2'. From [15], Theorem 4.3, each polynomial 
with constant coefficient 1 can be decomposed uniquely into a product of 
a reciprocal and a completely free polynomial. If n = 2k — 2 then clearly 
r(n) = s(n) = s(2k — 2) = 2*-!. If a polynomial p(x) of degree 2k — 1 or 
2k has a reciprocal factor g(x) of degree at least 2k — 2 then f(x)/g(x), a 
factor of degree 2 or 1, is itself reciprocal, so by Lemma 7.1 f(x) is recip- 
rocal. Hence r(2k — 1) = s(2k — 1) = s(2(k —1) +1) = 2*-1 = 2”-*, and 
FOk) = sO) = 22 =2"-*, 
Suppose n > 2k then since z(1) = z(2) = 0, 
n—3 
r(n)= D> s(J)2(n—J§) + 8(n)z(0). 


j=2k—2 


By direct calculation or by [15], proof of Theorem 4.4, 
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so we calculate, with d = 2k — 2 


dA 
s(j)z(n — Jj) = 8(2)z(n — 2) + 8(3)z(n — 3) + 


s(4)z(n — 3) +. 8(5)z(n — 4) + 


a(d — 2)z(n —d +2) + s(d—I)2(n—d+ 1). 
Suppose n is even. Then with n = 2m, by the preceding theorem, 
s(2t)z(n — 2t) + s(2t+ 1)z(n — 2t — 1) 
= 2’{2(2(m — t)) + z(2(m—t-1)+1)} 
= 2/50 amt} 4 4) 24 scam —4)+ 2} 


= ee hala _— g2m—t—2 


Se) 
d-1 2k-3 
s(j)a(n — j) = DS) s(j)z(2m — j) 
j=2 j=2 
k—-2 
_ g2m—t—2 
t=1 
— 92m-3.4 .,, 4 92m—k 
_ 92m—2 _ 92m—k 
_ 9n-2 _ on—k 
Therefore 
n—3 
r(n)= S > s(j)2(n—j) +.8(n)z(0) 
j=2k-2 
n—3 ai 
=} s(/)z(n— 9) + 8(n)2(0) — > a(f)2(n - 9) 
j=2 j=2 


= gr-2 _ {on- 2 9 ey gn— k 


We omit the similar argument for the case when n is odd. 
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Combining the preceding result with Lemma 7.5 gives the following. 


Corollary 7.2. With the same notation as above, let n > 2k —2 be a fixed 
integer. If n = 2k — 2 then the number of polynomials of degree n with 
constant coefficient 1 which meet at k is at most 2”—! —2*-! (equivalently, 
the maximum number of polynomials of coefficient 1 for which < uo, p > ts 
a qkword is 2"-1— 2-1). Ifm > 2k —2 there are at most 2°—-1—2"—* such 
polynomials. 


Definition 7.7. For a fixed nonnegative integer n and an integer k > 2 we 
denote by BP(n,k) the set of pairs (p,a) such that (i) p(x) is a polynomial 
of degree n with constant coefficient 1, (ii) a is a bitstream, and (iii) < 
uo, p > is a qkword for the binary shift o corresponding to the bitstream 
a and uo, ui,--.. are the generators for o. We shall refer to a pair (p, a) 
satisfying (2), (22), (47) as a binary pair which meeting at k. 


For the remainder of the section we assume that n > 2k — 2. For such 
n we shall show that there are at least (2”~! — 2"—*)2*-? binary pairs in 
BP(n,k). Note first that if (p,a) € BP(n,k) if and only if the coefficients 
of p(x) = co +a x+--:+cnx” satisfy the following linear system, for some 
choice of elements Io, 11,...,l,4-2 in F. 


ay,Co + aoc, + Q1Cg°°"° + An—3Cn—2 + An—2Cn—-1 + An—1Cn = ly 


a2Co + @1C1 + A0C2 +++ + Gn—aCn—2 + An—3Cn—1 + In—2Cn = lo 


GC “ OR—3Ci + Op—aey * © + +f Oy—7PCj—9 FF On—b Cn —i F On—Rioc, = leo 
Ak—109 + Ap—2C1 + Gp—3C2°+* + An—k—-1Cn—2 + An—kCn—1 + On—k4+1Cn = 1 
GpCo + Gp—101 + Op—909 + ** + On—K—2Cn—2 + Gn—K—10n—1 T+ On—KCn = 0 


Ap+41Cp + AKC1 + Ap—1€2 +++ + An—k—3Cn—2 + An—k—2Cn—1 + An—k-1€n = 0 


(3.3) 


Theorem 7.13. Let k > 2 be an integer. Then for all integers n > 2k — 2, 
BPO) = 2° * —2°*) - 9°, 


Proof. We divide the proof into cases depending on the parity of n. The 
proof is obtained as an application of the unimodality properties obtained in 
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[13] (see also [5, 22]) for the nullity sequence {v(A,,) : m € N} correspond- 
ing to the m x m matrices associated with any bitstream a = {a; : j € ZT}. 

Suppose first that n is odd. Let a = aj,a2,... be a bitstream for which 
Ay-_1 is invertible. If there is a polynomial p for which (3.3) holds, then 
since Co = Cn = 1 the first n — 1 equations of the system above may be 
rewritten as 


a +dn-1th 
a2 + Gn—2 + le 
ao ay ag ag... aAn-2 Ci a3 + Q@n—3 + ls 
ay ao a, a2... An-3 C2 : 
a2 ay ago @1 .-- Gn—4] . C3 = | @p—2 + Qn—K+2 + le-2 
Qp-1 + Qn—p41 +1 
bie Ce 6 Cd toe 4s ao Cn-1 Ak + An—k 
An-1 + ay 


Since A,_1 is assumed to be invertible, the matrix equation above has 
a solution for any choice of elements a,—1,lo,11,...,J—2. Also note that 
once a; through a,_; have been chosen, as well as /; through /,_2, there 
is one and only one choice for each of the remaining entries an,@n+41,.-. 
in order to satisfy (3.3). Recalling from [5], Corollary 2.10, that there are 
2”-3 choices of a1,...,@n—2 such that A,_1 is invertible, we see that there 
are 2?—3 . Qk-1 — 9n—2 . 9k-2 binary pairs (p,a) corresponding to the case 
when A,,_; is invertible. 

More generally let s be a fixed integer such that 1 < s < k—1. By [5], 
Theorem 2.9, it is possible to find a string of elements a1, d2,...,@n—s5 of F 
such that the matrix A,_(2s—1) is invertible and that v(A,_(2s-1)+») = 6 
for 0 < b < s. (Note that the case considered in the preceding paragraph 
corresponds to the case s = 1 here.) Momentarily deleting the first s — 1 
equations of the system (3.3) above we see that the next n—2s+1 equations 
may be written in the form 


ao ay ag a3... An—2s Cs 
ay ag a4 a2 ..+ An—2s—1 Cs+1 
ag ay ao A, ... An—2s—2} . | Cs+2 


An—-—2s An—2s—1 An—2s—2-++--> ao Cn—s 
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QgCo +t G@g-1C, Fees + 165-1 
Grp +  Ggeyp PsP aeg—7 


_ | @k-2Co + Ak—-3C1 +++ + Qk—s—1Cs-1 
QparCo + Gyo, Fes+ - Opus lgn1 
QkCo + Qk-1C1 +++ + Ak—s41Cs—1 
An—sCoO + An—s—1€1 5 phd An—2s—1€s—1 
An—2s+1€n—s+1 ap tem =e An—sCn bs 
An—2sCn—s+1 Saree An—s—1€n bead 
a An—s—k+3Cn—st+l1 Se An—k+2Cn a I, 
An—s—k+2Cn—s+1 1 *** + Q@n—k+1Cn 1 
An—s—k+1Cn—s+1 22 Se An—kCn 0 
a1Cn—st+1 +e =p AsCn 0 


To count the number of binary pairs (p, a) arising from this case we recall 
the following facts from [5]. By [5], Corollary 2.10, the number of choices of 
€1,42,...,An—2s for which A, (25-1) is invertible is 2”~?*—+. By [5], The- 
orem 2.7, since v(A,_(25-1)) = 0, then v(An—2s42) = 1 regardless of the 
choice of @,—2541- This latter condition follows from the phenomenon that 
the sequence {v(A;) : 7 € N} is unimodal, i-e., it is the concatenation of 
strings of positive integers of the form 1,2,...,m—1,m,m-—1,...,2,1,0. 
By the proof of Theorem 2.9 of [5] there is one and only one choice 
for the entries dn—25+2, An—2s+3;---;@n—s SO that v(An—2s4145) = 5 for 
2 < b < s—1. Observe from the matrix equation above that the co- 
efficients C1,...,€s—1,Cn—s+1;---;Cn—1 may be chosen arbitrarily, as may 
lg,---,lg-2. Once the strings a; through aj_,, c, through ¢s—-1, Cp—s41 
through c,—; and |, through J,_2 have been chosen, it follows by consid- 
ering all equations of (3.3) subsequent to the first n — s equations that the 
entrieS Gn—s+1,4n—s+2,--- are all uniquely determined. It then follows by 
examining the first s — 1 equations of the system (3.3) that |, through /,—1 
are all uniquely determined. Counting our choices for the entries above, the 
number of solutions to the system (3.3) for which v(A;_(2s—1)4») = 6 for 
(= b= gare Oo 2-1. 9).09) 4th OF gg Diet. OF, 
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We show that no binary pair (p,a) can arise from the construction 
above for more than one choice of s € {1,...,4 —1}. For suppose 1 < s’ < 
s < k—1 and (p,a) is obtained for both indices s and s’. Recall from the 
construction above corresponding to s that v(An—2541) = 0,Vv(An—2s42) = 
1,...,V(An—s41) = 8. It then follows from the unimodality properties of 
the nullity sequence {v(A;) : 7 € N} (see Theorem 7.8) that for n—2s+1< 
t<n—-1, v(A;) > 0, ie., that A; is not invertible. Hence v(A;) > 0 for 
n—2s+2<j<n-—1. Since (p,a) arises from the analysis for the integer 
s’, however, we have v(Ay_25/41) = 0. But n—25+1<n—2s'+1<n-1 
so V(An_—25’41) > 0. This contradiction yields the result. Adding all of the 
binary pairs obtained from each of the cases for s, 1 << s <n-—1 we get 


k-1 
». gn—s—1 . gk—2 = (~ _ ots : gk-2 
s=1 


distinct binary pairs. 

We sketch the proof for the case when n is even. For a fixed positive 
integer s € {1,2,...,k4 — 1} consider finite sequences ao, @1,..-,@n—s—1 Of 
elements of F' for which the corresponding sequence of Toeplitz matrices 
A;,j € {1,2,...,n — 8} satisfies v(An_os40) = 0,0 < b < s. An analysis 
similar to the one above shows that we obtain at least 2”~?° - 2*-? binary 
pairs which meet at k for each s, and therefore we have (2"~!—2”~*) . 2k? 
binary pairs which meet at k. L] 


Corollary 7.3. Let k > 2 be an integer. Then 


G) BPO,k) =2*-2. 

(ii) BP(n,k) =2"-1.2'-? ffl <n < 2k —-2. 
(in) BP(n,k) = (Q"-* = 2°-***) . 2°? an = 2k — 9, 
Gy) BPW) =O" —2°-*) . 2°? afin > 2k —2. 


Proof. The first three equations are obtained by combining the results 
of Lemmas 7.4 and 7.5. So suppose n > 2k — 2. By Theorem 7.12 there 
are 2”-* polynomials of degree n > 2k — 2 with constant coefficient 1 
having a reciprocal factor of degree > 2k — 2. By Lemma 7.5 none of 
these polynomials meets with any bitstream at the integer k. Therefore, 
among all polynomials of degree n with constant coefficient 1, there are 
at most 2”—! — 2”-* which meet some bitstream at k. By Theorem 7.10 
each such polynomial meets at k with at most 2*~? distinct bitstreams. 
Hence BP(n,k) < (2"7! — 2”-*) . 2*-?. But by the preceding theorem, 
BP (n,k) > 2 —28-").. 9°, Oo 
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The following result follows as a corollary to the proof of the preceding 
corollary. 


Corollary 7.4. The following are equivalent for any polynomial p(x) with 
constant coefficient 1 over GF(2), and any integer k > 2. 


(i) p(x) meets at least one bitstream at k. 
(ii) p(x) meets exactly 2*-? bitstreams at k. 
(iii) p(x) has no reciprocal factors of degree > 2k — 2. 


7.5. Conjugacy Classes of Binary Shifts 


As a consequence of the preceding results we are now in a position to estab- 
lish a correspondence between the conjugacy classes of binary shifts of finite 
commutant index and the family of polynomials over GF'(2) with constant 
coefficient 1. Specifically we provide an algorithm which can be used to 
compute, for any polynomial p(a) over GF(2) with constant coefficient 1, 
and any integer k > 2, the number of binary shifts o of commutant index k 
associated with p(a) in the sense that w =< uo,p > generates o*(R)’N R, 
(where u;,j7 € Z* are the generators for o. In Corollary 7.4 it is shown 
that for a fixed index k > 2, any polynomial p(x) meets either 2*~? bit- 
streams at k or it meets no bitstreams at k. In terms of binary shifts, using 
Remark 7.4, this means that for a polynomial p(x) there are either 2*~? 
binary shifts 0 for which < ug,p > is a qkword, or no such binary shifts. 
Note, however, that if w =< uo, p > is a qkword for some binary shift o it 
is not necessarily the case that o has commutant index k. As an elemen- 
tary example consider the polynomial p(x) = « + 1 and the binary shift o 
with bitstream {0,1,0,0,0,...}. Then < uo,p >= uour is a q3word. On 
the other hand, o is a binary shift of commutant index 2, with the word 
ug generating the relative commutant algebra o?(R)’M R. What is needed, 
therefore, is a way to determine how many of the bitstreams a which meet 
p(x) at k actually correspond to binary shifts of commutant index k. The 
following three results provide the key. 


Theorem 7.14. Let a be an eventually periodic but not mirror-periodic 
bitstream, t.e., the reflected bitstream a is not periodic. Let a, with gen- 
erators {u; : j € Zt}, be the binary shift on R corresponding to a. Let 
k € {2,3,...} be the commutant index of o. Then if p(x) is such that the 
word w =< uo,p > generates o*(R)'N R, then 


(i) (p,a) is a binary pair, 
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(ii) (p,a) meets at the integer k, 

(iii) of f(a) is a polynomial with constant coefficient 1 then (pf,a) meets 
at k+deg(f). 

(iv) if (g,a) is a binary pair for some polynomial g with constant coefficient 
1, then p is a factor of g and (g,a) meets at k + deg(g/p). 


Proof. First note that o is indeed a binary shift on R, since a is not 
periodic, Theorem 7.1. Also, o has finite commutant index, since a is 
eventually periodic, Theorem 7.4. Let p(x) = co +aiu +--+ + cnx”, then 
w = ug’uy'--- uc. Since w anticommutes with uz; and commutes with 
Uk, Uk+1,---, the infinite linear system (2.1) is satisfied. Hence (p,a) is a 
binary pair meeting at the integer k. This proves (7) and (77). To see (77) 
note that if f(7) =Ip that---+lyx2™ with Ip = 1 = 1, then by (1.3), 
< uo, pf >= + << uo,p>,f >=+<wu,f >= tw"o(w) ---o™(w)'. 
It follows that < uo, pf > anticommutes with uz_14m and commutes with 
Uk+m, Uk+m+1,--- Whence (777). 

To see (iv) let y =< uo,g >. Since (g,a) is a binary pair there is 
an integer ky where they meet. It follows that the word y anticommutes 
with uz,-1 and commutes with uz,, Uz 41,---, Le, y ¢ o1(R)' OR 
but y € o*(R)' A R. Since o has commutant index k then k < ko 
and y ¢ o*—!(R)'N R, y € o(R)'OR = {w,a(w),...,0°-*(w)}". 
We conclude that there is a polynomial h(x) of degree kg — k and 
constant coefficient 1 such that y= + < w,h >. But then < ug,g >= y= 
<b SH ck Se 1p.) Sh b= < Wo, h-S, 80 Pie)\h(e) = o(@). Hence 
p(a) is a factor of g(a). That (g,a) meet at k + deg(g/p) now follows from 
(iit). O 

As an immediate corollary we have the following. 


Corollary 7.5. Suppose o is a binary shift on the hyperfinite II, factor R 
with corresponding bitstream a = {do,a1,...}. Suppose there are an integer 
ko and a polynomial g(x) € Fax], with constant coefficient 1, such that g is 
paired at kg witha. Then ao has finite commutant index. In particular there 
is a unique polynomial p with constant coefficient 1 such that p and a are 
paired at k, the commutant index of a. Moreover, 


(i) p is a factor of g, and 
(ii) if the polynomial g/p has degree s, thenk +s = kg. 


Corollary 7.6. Suppose o is a binary shift on R with finite commutant in- 
dex k and corresponding bitstream a. Then for s > 0 there are exactly 2°~1 
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binary pairs (g,a) which meet at the integer k +s. Each such polynomial 
g has the form g(x) = p(x)f(«) where p(x) is the unique polynomial 
which meets a at k, and f(x) is a polynomial with constant coefficient 1 of 
degree s. 


Proof. The result follows from the preceding result and the fact that 
there are exactly 2°—! distinct polynomials of degree s with constant coef- 
ficient 1. Oo 


Definition 7.8. Let p(x) be a polynomial with constant coefficient 1 in 
GF°(2). For any integer k > 2 let C(p,k) denote the family of binary shifts 
o of commutant index k on R for which the word w =< ug,p > in the 
generators of a0 generates the first nontrivial relative commutant algebra 
o*(R)' OR. 


Remark 7.6. 


(i) Restating Theorem 7.6 (see also Remark 7.4 in terms of this nota- 
tion, we have C(p,2) = @ if p(a) has any reciprocal divisors of degree 
exceeding 1 and |C(p, 2)| = 1, otherwise. 

(ii) Since there are no qlwords, by Remark 7.3, C(p,1) = 0. 

(iii) Let p(x) = 1. Then C(p,k), for k > 2, consists of all binary shifts of 
commutant index k for which the word w =< uo,p >= ug generates 
o*(R)' OR. It is not difficult to show that these are the binary shifts 
each of whose bitstreams a = {ao,qa1,...} satisfies a,_1 = lay = 
Qk41 =-+:: =0. Note that, as a1, a2,...,@%—2 may be chosen arbitrar- 
ily, there are 2*~? such binary shifts, i.e., |C(p, k)| = 2"~?. 


The following result gives a recursive formula for computing the cardi- 
nality of C(p, k). 


Theorem 7.15. Let k > 2 be a fixed integer. Let p(x) be a polynomial of 
degree n. If p(x) has a reciprocal factor r(x) with deg(r(x)) > 2k —2 then 
C(p,k) =@. Otherwise, for each j = 0,1,...,n2—1 let 951, Gj2,.--,4jm,; be 
the distinct factors of p(x) of degree j. Then 


|C(p, k)| = 2°? — », IC(qi,k — (n— 9))| (4.1) 


max({0,2+n—k})<j<n-1 
1<i<m; 


Proof. If p(x) = 1 statement (iii) of the remark indicates that |C(p, k)| = 
2-2 It is clear that the summation in this case is 0 and the formula holds 
in this situation. If k = 2 and p(x) has no reciprocal factors of degree 
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> 1 then by statement (iii) the formula should be 1. Since C(p,1) = @ by 
statement (ii) the formula holds in this case. So we may assume that p(x) 
has degree > 1 and that k > 3. Let p(x) be a polynomial of degree n with 
constant coefficient 1. By Lemma 7.5 (see also Remark 7.4) C(p,k) = 0 if 
p(x) has a reciprocal factor of degree > 2k — 2, so we may assume that the 
maximal reciprocal factor of p(a) has degree < 2k — 2. Suppose |C(q,1)| is 
known, for all polynomials g(a) of degree < n and alll € {2,3,...,k — 1}. 
Suppose a is a bitstream which meets p(x) at k. Let o be the corresponding 
binary shift on R. Then either o € C(p,k) or by Corollary 7.5 there is an 
le {2,...,4 —1} such that o has commutant index | < k. In the latter 
case there is a unique polynomial q(x) for which o € C(q,l). Suppose 
deg(q) = 7 < n—1. By Corollary 7.5 q(x) is a proper factor of p(a) and 
k = 1+deg(p/q) = !+n—j,sol = k—(n—j). Since C(q,/) = 0 unless / > 2,, 
we must have k — (n —j) > 2, or j > 2+ (n—k). Of course 7 > 0 also. On 
the other hand, q(x) is a proper factor of p(x), so 7 = deg(q) < deg(p) =n, 
hence maz({0,2 + (n — k)}) < 7 < n—1. Hence every binary shift o of 
commutant index less than k, for which < uo,p > is a qkword (where 
{u; : j € Z*} are the generators of o) is accounted for in the summation 
in the formula above. 

Conversely, suppose q(x) is a proper factor of p(x), and suppose 
a € C(q,k — deg(p/q) = C(p,k — n+ deg(q)). Let a be the bitstream 
corresponding to o. Then by Corollary 7.4 and Theorem 7.14, a is one of 
the 2*~? bitstreams which meet p(x) at k. Hence the summation in the 
formula subtracts from the 2*~? bitstreams corresponding to p(x) any bit- 
stream associated with a binary shift o of commutant index < k for which 
< uo,p > is a qkword. Hence the right side of the formula above counts all 
binary shifts o of commutant index equal to k for which < uo, p > generates 
oh) TR: O 


Corollary 7.7. There are countably many conjugacy classes of binary 
shifts of any finite commutant index. 


Corollary 7.8. Let p(x) be an irreducible polynomial over GF(2) of degree 
n>1. Let k > 2 be an integer. If p(x) is reciprocal then 


(i) 
(ii) 
(iii) 


C(p,k)| = 2*-? ifk-—1<n<2k—-2, and 


C(p,k)| =0 ifn > 2k — 2, 
C(p, k)| = 2*-2 — 2'-"-2 Ff#O<n<k-2. 


134 G. L. Price 


If p(x) is not reciprocal then 


k-2 ifn >k—1, and 
aS Pe OS ey = =D: 


=e 
2 
Proof. (i) follows immediately from the first assertion of the theorem. 
Otherwise, since 1 is the only proper factor of p(a) the formula in the 
theorem reduces to |C(p,k)| = 2*~? — |C(1,k —n)|. If n > k —1 then 
|C(1,k — n)| = 0, so (ii) and (iv) follow. If 0 <n < k—2 then |C(1,k — 
n)| = 2*-"~? by (iii) of the remark preceding the theorem, and |C(p, k)| = 
2k-2 _ 1C(1,k —n)| = 2-2 — 2'-"-?. giving (iii) and (v). O 


What follows is an algorithm for determining the bitstreams of those 
binary shifts which lie in C(p,k), for any polynomial p(a) over GF (2) 
with constant coefficient 1. If p(w) = 1 then C(p,k), by Remark 
7.6(iii), consists of binary shifts whose bitstreams are of the form a = 
{0, a1, @2,...,@—2,1,0,0,...}. Suppose p(x) has degree n > 0 and sup- 
pose k > 2. Suppose moreover that p(a) has no reciprocal factors of degree 
> 2k —2. Assuming the bitstreams for all binary shifts in C(q,1), deg(q(x)) 
<n,l<n-—1, have been determined we find the bitstreams associated with 
the binary shifts in C(p, k). 

To do this we first seek any bitstream a which meets p(x) at k. If q(x) 
is a factor of p(x) such that C(q,k — deg(p/q)) 4 0 then by Theorem 7.14 
the bitstream associated with a binary shift in this set meets p(x) at k. 
If no such factor q(a) exists we find a as follows. If deg(p(a)) < 2k — 2 
then we may obtain a as in the proof of Lemma 7.4. If deg(p(x)) > 2k — 2 
then we can find a bitstream a by solving the system consisting of the first 
n+1 equations in the infinite system (3.1). Having done that, the remaining 
elements {@n—K,@n—k+1,--- } are obtained from the remaining equations in 
(3.1) using the fact that co = 1. For the next step, let s@),s®),...,s™ 
be the bitstreams as in the proof of Theorem 7.10. For 1 < 7 < m let 
bY = a+s'™. (From that theorem it was determined that m < 2*~?. From 
the computation of BP(n,k) in Corollaries 7.3 and 7.4, however, it turns 
out that m = 2*~?.) By the theorem we have found all of the bitstreams 
which meet p(x) at k. Note that all of these bitstreams correspond to binary 
shifts in C(p,k). For if not, the proof of the theorem above implies that 
one of these bitstreams meets with some factor q(x) of p(x) at the integer 
k; — deg(p/q), a case which we have ruled out. 

Since a bitstream is a complete conjugacy invariant for binary shifts on 
R, the procedure above leads to the following. 
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Theorem 7.16. The algorithm above gives a complete classification of the 


binary shifts of finite commutant index up to conjugacy. 
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8.1. Introduction and Preliminaries 


Combinatorial Designs first appeared in the medieval literature mainly as 
puzzles. We will dwell on this theme somewhat later during the course of 
this article. The medieval forerunner of design theory is thus the famous 
Euler conjecture, disproved by Bose, Shrikhande and Parker. In the twen- 
tieth century, Combinatorial designs, mainly in the form of constructions 
were first looked at by Statisticians. Specifically a designs or a 2-design 
has the following definition. By an incidence structure, we mean a triple, 
consisting of the set of points, the set of blocks and an incidence relation 
between the two sets. It is common to identify a block with the set of points 
incident to it; but if there are repeated blocks, then the blocks form a mul- 
tiset of the sets of points. An incidence structure is called a 2-design (or 
a balanced incomplete block design) if every block has a constant size k, 
which is strictly less than v, the number of points, and any two points occur 
together (are commonly contained in) the same number X of blocks. If b 
the number of blocks and r the number of blocks containing a given point, 
then such a configuration is called a (v,b,r,k, A)-design or sometimes also 
called a (vu, k, \)-design. 

A two-way counting produces following two equations on the parameters 
of a design D. 

A(v — 1) = r(k -1) 
ur = bk. 
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If we look at the (point-block) incidence matrix N of a design D, then 
it is easy to see that the rows of N are linearly independent. Since N is a 
v X b matrix, we see that 


Theorem 8.1. (Fisher’s inequality) We always have v <_ b, i.e., the 
number of points is less than or equal to the number of blocks. Further if 
D is a design with v = b, the incidence structure in which blocks are points 
and points are blocks is also a design with the same parameter set (v,k, A). 


Definition 8.1. A design with v = 6 is called a symmetric design. 


2-designs were called Balanced Incomplete Block Designs (BIBDs) by 
statisticians and were used by them in order to get rid of “two-way hetero- 
geneity”. For a statistician, the mathematician balance, which just means 
that every pair of points is contained in exactly A, a constant number of 
blocks, translates into statistical balance which means that all the elemen- 
tary treatment contrasts are estimated with the same variance. 

If k = 3 and A = 1, then the necessary conditions yield v = 2r + 1 and 
hence v is odd. Also the second condition says that 3 divides r(2r +1) and 
hence v has the form 6m+ 1 or 6m+3. Such designs, with k = 3 are called 
Steiner triple systems. Fano plane is the smallest example of a Steiner triple 
system. Through recursive constructions, it is not very difficult to show that 
for all orders v = 1,3 (mod6), a Steiner triple system of order v exists. In 
the year 1847, Kirkman, who was a minister with the Church of England, 
posed the following question: 15 schoolgirls are to be arranged in batches of 
three girls each for seven days of the week such that every two of them are 
together in the same batch in exactly one batch on some day and every day 
we have 5 batches of girls. This amounts to constructing a Steiner triple 
system on 15 points and 5 x 7 = 35 blocks (or lines) such that the system 
is “resolvable”. The general problem of constructing a resolvable Steiner 
triple system for every order v of the form 6m + 3 was solved in 1970s by 
Ray-Chaudhuri and Wilson. 


8.2. Symmetric Designs 


Let us have a second look at Fisher inequality and the Symmetric Designs. 
If N denotes the point-block incidence matrix of such a design D, then the 
design properties translate into the following matrix equations: 


NNt=(r—A)I-AJ, IN=kJ, NJ =rJ 
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where J is the identity matrix and J denotes the all 1-matrix (of appropriate 
order). If v = b, then the matrix N is square and non-singular and hence 
multiplication of the first equation by N~' on the left and N on the right 
(note that r = k, in this case) obtains: 


N*N = (k—-A)I+AN 1IN 


But NJ = kJ and hence N-!J = k~1J and we thus get: N'N = 
(k — A) AJ. An interpretation of this is the following: 


Theorem 8.2. [f D is a symmetric design, the block-point incidence struc- 
ture D* called the (combinatorial) dual of D is also a symmetric design with 
the same parameters v,k and X. 


Let H denote a Hadamard matrix of order 4t. This is a matrix of order 
4t with entries +1 such that any two rows of H are orthogonal. Multiplying 
rows/columns by —1 does not change the Hadamard property and hence 
we can assume w.l.o.g. that H is in a standard form. That is the first row 
and the first column of H consist only of +1’s. Then deleting the first row 
and the first column of H and changing —1’s to zeroes, we get an incidence 
matrix of a (v,k,A) = (4t—1, 2t—1,t—1) symmetric design which is called 
a Hadamard symmetric design. This procedure is reversible and hence the 
existence of a Hadamard matrix and a Hadamard symmetric design go hand 
in hand. 

We look at a projective space of dimension 2 over GF(q) and declare 
its points as points and its lines as blocks of a symmetric design. This is 
called a projective plane and has parameters (q? + q+ 1,q+1,1). This 
construction works for every prime power q. More generally, we may take 
an n-dimensional projective space over GF(q) and declare its points as 
points and its hyperplanes as blocks of a symmetric design. This is called 
a projective design and has parameters 


m+1_o4 
go 8 ee ae 
q=1 g=1 q=1 


In the special case, when the projective space has dimension 2, the 
corresponding symmetric designs have A = 1. Symmetric designs of this 
kind with \ = 1 (that is, with parameters v,k,\) = (q? +q¢+1,q+1,1)) 
are defined to be projective planes of order q. 
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Construction: Let 7 be an affine plane of order q with parallel classes of 
lines 7, 72,.--,7%q+1- Let the lines of a-th parallel class 7, be denoted by 
lat, t=1, 2,...,q. Let the points of 7 be co, c1,..., Cg2-1. For 7a, define a 
matrix A of order q? indexed by the points of 7 as follows. A = [aj;], aij = 1 
iff 7, 7 € lat for some t and a;; = 0 otherwise. If we denote this A by Ag 
(for the a th parallel class) then, 


e A, is asymmetric matrix. 


cape BG 
allie J ifa’4a 


et A =qi t+ J. 


Now let L = [Lag],a,2 =0, 1, 2,...,q+1 denote a Latin square on the 
symbol-set {0, 1, 2,...,¢+1}. Replace Lag by the matrix A, if Lag =a 
and replace Lag by 0 (a zero matrix of order q?) if Lag = 0. Then let the 
resulting matrix be M = [Magl,a,@ =0, 1, 2,...,q+1. Then (1), (2) and 
(3) imply that MM? = q?I +qJ proving that M is a incidence matrix of a 
symmetric design with parameters 


v=q(qt+2), k=q(q4+1), A= (+) 


Also, the construction (see [5, 6]) shows that we have a point-partition 
in which every point class (this corresponds to the juxtaposed q? x (q? +2q7) 
matrix [Mg] where ( takes all values from 0 to q+ 1) is an affine plane 
of order q (repeated q times). Hence MW is an incidence matrix of a quasi- 
affine design (defined later in this exposition). This family of designs is 
important for several other reasons. The first construction of such designs 
was obtained by Ahrens and Szekeres and these designs naturally give rise 
to strongly regular graphs. Finally, the constructions of Ahrens and Szekeres 
also connects these objects with generalized quadrangles. We will get back 
to this theme a bit later in this exposition. 

For the reasons of structural symmetry and better connections with 
group theory, symmetric designs are objects of considerable interest. Many 
other constructions of symmetric designs are known. The existence ques- 
tion for Symmetric designs is the question of constructing a (0, 1)-matrix 
satisfying the matrix equations given above. Algebraic number theory has 
been employed in order to answer this existence question and the relevant 
seminal result is called the Bruck-Ryser-Chowla theorem. Unfortunately, 
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it works only in one direction. That is, it provides us with only a necessary 
condition which, may not be sufficient. For example, it is not known whether 
there is a projective plane of order twelve but it is known, thanks to the 
Bruck-Ryser-Chowla theorem that 


Theorem 8.3. Jf q is the order of a_ projective plane such that 
q = 1,2 (mod 4) then q is a sum of two integer squares. In particular, 
there are no projective planes of orders q such that q = 6 (mod 8). 


An extensive search ([13]) for almost 200 hours on the fastest CRAY 
computer available then proved in the late 1980s that there is no projective 
plane of order ten. 

In a recent seminal paper on this difficult problem of constructions of 
symmetric designs, Ionin ([9]) has obtained an elegant general method that 
yields about ten different families of symmetric designs not known until 
three years ago. Improving very strongly on the methods of Dinesh Ra- 
jkunlia where the use of weighing matrices and designs was first made, 
Ionin’s constructions have far reaching consequences. Some simplifications 
of Ionin constructions are published in the last couple of years and the au- 
thor also expects to write out some simplification of Ionin constructions. 
We give below one of the theorems of Ionin. 


Theorem 8.4. There exist symmetric designs D with parameters: 


(q+1)?"-1 
q+2 


(q+1)?"""(q +2) 


,k=(q+1)™", A= 5 


v=1+2(q¢+1) 
where q = 2? —1 is a Mersene prime and m is a positive integer. 


About 23 infinite families of symmetric designs are known. However, all 
the known examples of symmetric designs seem to be only of the following 
types: 


e When \ = 1, we have a projective plane of order q with parameters 
(q7+q+1,q+1,1). These exist for every prime power g. No other 
examples are known. 

e When \ = 2, we have a biplane with parameters ((5) +1,k,2). These 
are known to exist for the following values of k: 3,4,5,6,9, 11,13. No 
other examples are known. 

e When A = 3, all the known examples have k bounded by 15. 

e When \ > 4, all the known examples have k bounded by d? +X. 
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The known situation led to the following informal conjecture attributed 
to M. Hall. 


Conjecture 8.1. 


(a) VA > 2, there exist only finitely many symmetric (v, k, d). 
(b) Stronger form: VA > 4, the parameters of a symmetric design satisfy 
k<d? +X. 


Symmetric designs with parameters (v,k,\) where k equals q? + q and 
A equals q are also related to the famous design extension theorem of 
Cameron. We will refrain from discussing the actual statement of Cameron 
extension theorem but will discuss the original problem of permutation 
group extensions from which the design extension problem arose. Let (G, X ) 
denote a permutation group. We call the action of G on X transitive, if 
given any points x and y in X there is some a in G such that a(x) = y. 
In general, G is said to be t-transitive on X if given any t-tuples of points 
X say (%1,%2,...,%4) and (y1, y2,---, Yt), we have some a in G such that 
a(x;) = y; for alli = 1,2,...,¢. Further, this action is sharp (i.e. G is sharp 
t-transitive) if such an a is unique. Trivial examples of sharp n-transitive 
actions are the symmetric groups S,,. Jordan proved in the last century that 
no non-trivial sharp t-transitive group exists fort > 6 and fort = 4,5 
the non-trivial sharp 4 and 5-transitive groups are precisely the Mathieu 
groups Mj; and Mj. that act on sets of orders 11 and 12 respectively. We 
call a symmetric design D with parameters (v,k, A) extendable if there is 
a 3-design D’ with parameters v’ = v + 1,k’ =k +1, = XA such that if 
we look at all the blocks containing a single point oo of D’, then the re- 
sulting incidence structure is D. The symmetric design extension theorem 
of Cameron is the following. 


Theorem 8.5. Let D be a symmetric 2-design that has an extension (to a 
3-design). Then D has one of the following as its parameters. 


(1) D is a Hadamard 2-design. 

(2) v= (At 2)(A? +442) andk = 2? 4+ 3A 41. 

(3) (v,k, A) = (495, 39, 3). 

(4) D is a projective plane of order ten and hence is a 2-design with 
(w, dA) = (111, 11,1), 


The first type in this list contains all the Hadamard symmetric de- 
signs and extensions exists for all the Hadamard symmetric designs. Other 
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than this infinite class, there is a sporadic 3-design and a parametrically 
possible projective plane of order ten. Besides these, the Cameron classifi- 
cation theorem also The second type is a putative infinite list of extendable 
symmetric designs, the first object of which is the projective plane of order 
four. This object does have an extension, not just to a 3-design but even to 
4 and 5-designs. These extensions, called the Witt designs, are intimately 
connected with the Mathieu groups M22, M23 and M24. The next parame- 
ter set in this list is a symmetric design with (v,k, A) = (56, 11,2). As we 
already saw in the discussion on M. Hall’s conjecture, none of the designs 
of the second type with A > 3 has been constructed so far and nor has any 
one shown that designs with such parameters cannot exist. (the question 
of extension naturally makes sense only after that). The same remark also 
applies to a design of the third type with (v,k, A) = (495, 39,3) whose ex- 
istence is an open question. Fortunately, thanks to the result of Lam cited 
earlier, the fourth type cannot arise because there is no projective plane 
of order ten (though this was not known at the time Cameron proved his 
theorem). 

We briefly deal with the question of non-isomorphism. An isomorphism 
@ between two symmetric designs D, and Dz is a pair (¢’, 6”) of two per- 
mutations, the first from the point-set of D, to the point-set of Dg and 
the second from the block-set of D, to the block-set of De such that the 
incidence is preserved. When D,; equals D2 this is called an automorphism 
and it is easily seen that the set of automorphisms of a symmetric design 
form a group. This is one way in which we can actually construct a sym- 
metric design from a group. This is called the method of difference set. 
For example developing the initial block (0, 1,3) modulo 7 gives us all the 
7 blocks of the Fano plane. The problem of determining whether or not 
two given symmetric designs are isomorphic is essentially a uniform hyper- 
graph isomorphism problem (hypergraphs are objects for which edges are 
subsets of size greater than 2). This problem is evidently at least as hard 
as the corresponding problem of graph isomorphism which is known to be 
an NP-complete problem. 

It is on this background that producing a large number of non- 
isomorphic symmetric designs (with the same parameters) is a pertinent 
question. This question was first handled in the case of Hadamard symmet- 
ric designs by Bhat-Nayak, Shrikhande and Singhi. For the case of projec- 
tive designs (the point-hyperplane incidence structure), this was done first 
by Kantor and later by Jungnickel. In both the cases, the lower bounds 
are asymptotically exponential. A recent result establishes the same type of 
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result for designs of Ahrens-Szekeres type, i.e., those designs with k = \?+). 
Among various lower bounds proved in that paper of Gharge and Sane ((6]), 
one is in terms of Catalan numbers and one is in terms of special partitions 
of integers. Since both these numbers are exponential, the established lower 
bound is also exponential. We conclude by pointing that there is a marked 
difference between techniques employed by the earlier authors (Shrikhande, 
Singhi, Jungnickel) and those in the recent paper. In the former situation, 
techniques are recursive while in the latter situation they are direct and 
depend on graph isomorphisms as well as partition functions in number 
theory. This forms a part of the recent Ph.D. thesis of Gharge. Specifi- 
cally, call a symmetric design of the Ahrens-Szekeres type, that is one with 
parmeters 


v=P(q+2), k=q(qt1), A=¢@ (*) 


quasi-affine if its point-set admits a partition into g + 2 point classes each 
with gq? points such that the induced incidence structure in each point 
class is an affine plane of order q (repeated q times). We have already 
come across this special class of symmetric designs with parameters as in 
(*) in Construction in Section 8.2. It was shown by the author that such 
a partition (if it exists) is unique (that is, it is uniquely determined by 
the structure of the given incidence structure. This was exploited in the 
same paper of the author, to show that, upto isomorphism, there are at 
least two non-isomorphic quasi-affine designs for any given prime power q. 
This result was vastly improved recently and two of the main results of 
Sanjeevani Gharge’s Ph.D. thesis ([5]) are the following. 


Theorem 8.6. Let q be a prime power. Then: 


(1) The number of non-isomorphic solutions of quasi-affine designs with 
parameters (*) is at least mv4. 
(2) Let Cy, denote the m-th Catalan number defined by 


Cm = = (7?) 
m\m-—-1 


and let q > 7. Then the number of non-isomorphic quasi-affine designs 


with parameters (*) is at least 2MCm (in both the cases m denotes |$] 


and here |x] denotes the largest integer less than or equal to x). 
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8.3. Strongly Regular Graphs 


Recall that a regular graph I’ of degree k is a simple graph in which every 
vertex has degree k. The graph [’ with n vertices and regular of degree k is 
called a strongly regular graph with parameters (n,k,a,c) if we have two 
more constants a and c such that 


e If x and y are adjacent vertices then the number of vertices commonly 
adjacent to both of them is a. 

e If x and y are non-adjacent vertices then the number of vertices com- 
monly adjacent to both of them is c. 


It is not difficult to see that the complement of a strongly regular graph 
(srg) [ is also strongly regular. In order to make the situation non-trivial 
and interesting, we insist that both I. and its complement are connected 
graphs on at leastn > 4 vertices. In particular, this means that neither T 
nor its complement are complete graphs. Here are some examples. 


(1) A Triangular Graph T,, is defined to be the line graph L(K,,) of a 
complete graph. Equivalently, T,,, has a vertex set that consists of all 
unordered pairs (2-subsets) of the set M = {1,2,...,m} of order m 
with two vertices adjacent iff the corresponding pairs have an element 
in common. The parameters of T,, are: 


((z) ana aa) 


(2) A Lattice Graph (sometimes also called a Latin Square Graph) X L(m) 
is the line graph L(Ky,,) and has the following description. V = M x 
M where M is as in the previous example. Two vertices of X are 
adjacent iff they have the same first or second co-ordinate. X has the 
parameters: 


(m?, 2m — 2,m — 2,2) 


When m # 4, such graphs are uniquely determined by their parameters. 
For m = 4, there is an exceptional graph called the Shrikhande graph. 
This is not isomorphic to Z(4) and in fact has an embedding on a torus 
(which L(4) does not have). Another reason for non-isomorphism is that 
the full automorphism of the Shrikhande graph has order 16 x 12 = 192 
(in fact, a point stabilizer is the dihedral group Dg) while the full group 
of L(4) has order 2 x (4!)?. 
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(3) The Petersen graph P is a graph on 10 vertices and is the complement 
of T5. Its parameters are (10, 3,0, 1). This graph is also a quotient of the 
graph of the 1-skeleton of a regular dodecahedron. Its automorphism is 
the symmetric group Ss. 


Strongly regular graphs were first defined by Bose in 1959 in the context 
of embedding problems of Bruck nets, though, in the form of association 
schemes these objects were introduced earlier in the paper of Bose and 
Nair. These objects have connections with group theory. Specifically, let G 
denote a rank 3 permutation group. Recall that this is a transitive group in 
which the stabilizer G, of any point x has precisely two orbits other than 
the singleton x. It is easily seen that this is independent of the choice of x. 
Equivalently, rank of a transitive permutation group G acting on a set X is 
the number of orbits of X x X (again, one orbit here is the diagonal {(z, x)}). 
If these orbits are symmetric (and this happens if and only if G has even 
order), that is (x,y) and (y,2) are in the same orbit, then we can define a 
graph [ whose edges are the pairs in one of the non-diagonal orbits. This 
graph is strongly regular and the given group G acts as an automorphism 
group of this graph. A prototypical situation where this holds is the group 
G that consists of the set of all affine transformations of a finite field 
given by: x > a2x+b where a and b are elements of K and a is a non-zero 
quadratic residue. To get symmetric orbits, we require that |k'| = 1(mod4). 
The family of graphs obtained through this construction is called the Paley 
family of graphs. 

An adjacency matrix A of a graphT or order n is a binary square matrix 
of order n with entry 1 at (7, 7)-th place iff the i-th and j-th vertices are 
adjacent and 0 otherwise. A is a real symmetric matrix and hence must 
have real eigenvalues. 


A? =kI+aA+c(J —I-A) 
=(k—c)I+(a—c)A+cJ 

and hence we get: 
A* —(a—c)A—(k—oc)I =cJ 


Since I’ is connected and regular of degree k, k is a simple eigenvalue 
of A with a corresponding eigenvector that consists of all 1’s. The other 
eigenvalues can be read off from the above matrix equation and they satisfy 
the quadratic equation: 


e* —(a—c)e—(k—c) =0 (#) 
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Let 6 and r denote the other eigenvalues of A. Then I connected implies 
that these eigenvalues have smaller modulus than k. Further, since the 
product 07 is —(k —c) < 0 as shown by (x) we see that one of these 
numbers is positive and the other is negative, where, by convention, we take 
0 to be positive. It was shown by Shrikhande and Bhagwandas that a regular 
and connected graph T is strongly regular iff it has exactly two eigenvalues 
besides the degree of regularity. Now let A denote the discriminant of (x). 
Then we have 


A = (a—c)? +4(k—c) 
=(0+7)? —46r 
2pt] = (0 — 7)? 
Let mg and m, denote the respective multiplicities of the eigenvalues 0 


and 7. Since the trace of A is 0, we can read off two equations for mg and 
m, in terms of 6 and T: 


metm,-=n-1 


me +m,T = —k 


Theorem 8.7. (Friendship theorem of Erdés and Sés and the 
(v,k,A)-graphs) In a gathering of n people every two persons have ex- 
actly one common friend. The Friendship theorem then asserts that there is 
a person who knows all the rest (and the corresponding graph is the windmill 
graph whose picture is shown below): 
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For a proof by contradiction, we assume there is no one who knows 
all the rest. Then a small argument will prove that every person has the 
same number of friends and hence the graph IT is regular and the given 
conditions imply that it is strongly regular with a = c = 1 and plugging 
these values shows that A equals 2\/k — 1. This gives the values of 6 and 
T to be +2Vk—1 and since mg and m, must be integers, we see that 
(mg —m,)(2VWk — 1) equals —k from which it follows that k —1 is a perfect 
square and Vk —1 divides k which holds only if k = 2, a contradiction. 
This investigation can be generalized to the following: Every two persons 
have the same number \ of friends (here X > 2) What are the possibilities? 
The corresponding strongly regular graphs are called (v, k, A)-graphs ([19]) 
and they give rise a class of symmetric designs. 


Further Investigations of Strongly Regular Graphs: (|7]) 
We could, of course, solve the equations explicitly for 0,7 and mg and 
m, to get: 


a-c+VA | a—e— wh 


6 =  —— 
2 2 
and 

(n-—1)rT+k 
me = 

O—T 
_m-1e+k 

_ @—T 


Alternative expressions for mg and m, are 


Hig) ¢ Bte—Ne-9 


There are two distinct possibilities. In case, mg = m, = m, clearly, we 
must have 2m = n— 1 and also 2k + (n—1)(a—c) = 0 so thata < c 
and ifa < c— 2, then we get 2k > 2(n — 1) which leads tok =n-1la 
contradiction since the graph is not complete. So we must have a= c—1 
and k = m. Finally a two-way counting gives: 


k(k-1—a)=(n-—k-—-l1)c (&) 


(Hold a vertex « fixed and count in two ways all the pairs (y, z) such 
that y is adjacent to x but z is not adjacent to x while y and z themselves 
are adjacent.) 
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Substitution of n = 2m+1 and k = m gives the simplification a + c = 
m — 1 and hence we see that c = } and the parameters are: 


Jad Gas Gas 
(nskrase) = (9 954) 
where gq = 1 (mod 4). This is customarily called the half-case and the Paley 
graphs are examples of the half-case. We have already seen the construc- 
tion of Paley graphs. Paley graphs exhibit a remarkable level of structural 
symmetry. Their automorphism groups are rank 3 groups. The following 
two points should be noted about the half-case strongly regular graphs. 


(1) A strongly regular graph falls into the half-case if and only if has the 
same parameter as its complement. In particular, a strongly regular 
graph which is isomorphic to its complement (a self-complementary 
graph) belongs to this case. 

(2) A necessary condition for the existence of a graph falling in the half-case 
is that n must be a sum of two squares. 


Not the Half-Case; Moore Graphs: 

A regular graph of degree k and diameter 2 has at the most k? + 1 
vertices and in case of equality such a graph is called a Moore graph. Our 
investigation will now include these graphs. In case we are not in the half- 
case, the difference between mg and m, is non-zero and hence it is easily 
seen that 2k + (n —1)(a—c) is a non-zero integer. In order for mg and m, 
to be rational (in fact integers) it is then essential that A is a square of an 
integer and the integer VA must divide 2k + (n—1)(a—c). We record this 
in the following theorem. 


Theorem 8.8. Let T be a strongly regular graph which is not a half-case. 
Then: 


e A =(a—c)*?+4(k—c) is a perfect square say u?. 
e u divides 2k + (n—1)(a—c). 


Moore graphs are characterized by a = 0 and c = 1. Here we neither 
have triangles (because a = 0) nor 4-cycles (because c = 1) but have a 
very large number of pentagons. Here we have u? = 4k — 3. The counting 
equation produces n = k? + 1 and hence u divides k? — 2k = k(k — 2). If 
k = 2 then we have C's; the cycle with 5 vertices. In other cases, u divides 
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ea ©) and hence it follows that u divides 15 giving 1,3,5,15 as the 
only possibilities for u. Here u = 3 corresponds to k = 3 and we have the 
Petersen graph. When u = 5, we get k = 7 and n = 50 and we get the 
Hoffman-Singleton graph. When u = 15, we have k = 57 and the existence 
of a graph in this situation is an open question. 

The question of existence of a strongly regular graph (n, k, a,c) satisfy- 
ing the obvious necessary conditions (*) is wide open. There are some other 
intricate necessary conditions such as the Krein condition. A good source 
of strongly regular graphs, from the point of view of a design theorist, are 
the 2-designs that have \ = 1. For such a design, any two blocks intersect 
in either 0 or 1 point and we can thus declare two blocks to be adjacent 
if they are disjoint. This graph turns out to be strongly regular (the class 
of quasi-symmetric designs studied in the next section is a generalization 
of such designs). Even good characterizations of such specialized strongly 
regular graphs are not available. Designs of this type (those with A = 1) are 
a special case of what Bose called partial geometries but we will not go into 
these things in the present exposition. A special class of partial geometries 
is the class of Bruck nets, which are equivalent to sets of mutually orthog- 
onal Latin squares. Therefore, sets of mutually orthogonal Latin squares 
provide prolific source of examples of strongly regular graphs. 


8.4. Quasi-Symmetric Designs 


Take a design D, not necessarily symmetric. An integer x is called a block 
intersection number of D if we have two blocks X and Y the cardinality of 
whose intersection is «. Which numbers occur as block intersection numbers 
of a design? Thanks to the proof of Fisher’s inequality, we see that D has 
exactly one block intersection number iff it is a symmetric design. 


Definition 8.2. A design D with two block intersection numbers x and 
y (with « <_ y by convention) is called a Quasi-symmetric design. 
Such a design is called a proper quasi-symmetric design if both the block 
intersection numbers occur (and hence D is not a symmetric design). 


Reasons for studying quasi-symmetric designs are many. A mundane 
and practical reason is that symmetric designs are more difficult to study 
(this is not completely true but sometimes believed to be so). On a more 
serious level quasi-symmetric design allows one to construct its block graph 
which in most cases of interest can be shown to be strongly regular. Finally 
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quasi-symmetric designs are connected with combinatorial configurations 


arising out of finite simple groups. Here is a small list of examples. 


(1) 
(2) 


(3) 


As was observed in the last section block designs with \ = 1 are quasi- 
symmetric with (x,y) = (0,1). 

Every n-dimensional affine geometry (with hyperplanes as blocks) over 
GF‘(q) furnishes an example of a quasi-symmetric design. Here v = q” 
and k = q"~! while the intersection pair (a, y) is given by (0,q"~?). 
Let s be a fixed positive integer. Take a symmetric (v, k, )-design D 
and let D’ be obtained from D as follows: D’ has the same point-set 
as that of D. As blocks of D’ we take s copies of all the blocks of 
D (customarily, this is expressed by saying that each block of D is 
repeated s times). Then D’ has parameters 


vy =u, 0 0,7 = sk, Hh =X |) 


This design is easily seen to be quasi-symmetric (there are repeated 
blocks) and we have x = \ and y = k. However, a 2-design D’ with 
parameters as above may actually exist independent of the existence 
of a symmetric (v, k, \)-design. Such a 2-design D’ is called an s-quasi- 
multiple design (of a symmetric (v, k, A)-design), where s is an integer, 
s > 1. Observe also that the existence question of an s-quasi-multiple 
design makes sense irrespective of whether a corresponding symmet- 
ric design exists or not. But this question naturally assumes a lot of 
importance when a corresponding symmetric design does not exist (is 
known not to exist). It is obvious that quasi-multiple designs with par- 
ticular parameters can be constructed using suitable subsets of a finite 
field. For example, a (7,3, 1) symmetric design is constructed by taking 
quadratic residues in the field with 7 elements. Similarly, a (37, 9, 2) 
symmetric design is constructed by taking biquadratic residues in the 
field GF'(37). In general, suppose v = KRY) +1 is such that a GF(v) 
exists (necessarily then v is a prime power). Now suppose u = — is an 
integer. Now consider the “development” of sets of the following kind 
in the sense of C. R. Rao or R. C. Bose: 


iptv 
ar: en ok ae 


where a is a primitive element of the field. Then by taking sufficiently 
many A;’s and their developments, one will have constructed a quasi- 
multiple of a biplane. 
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(4) 
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Delete a block of a biplane (that is a symmetric design with A = 2). 
What we get is a design in which every two blocks intersect in either 1 
or 2 points. We thus get a quasi-symmetric design. Unfortunately, due 
to the fact that only a finite number of biplanes is known as of now, 
this will only produce a finite number of quasi-symmetric designs. The 
famous Hall-Connor embedding theorem shows that such designs can 
always be embedded into a unique biplane by supplying the missing 
block. 

Let D be a point-block incidence structure constructed from PG(n, q) 
a projective geometry of dimension n over a field with q elements by 
taking as points the points of the geometry and as blocks the subspaces 
of codimension two (where n > 3). Parameters of D as a quasi- 
symmetric design are: 


or = on 
There are other classes of examples particularly the affine geometries 
(where x = 0) There is also a classical object called the Witt design 
on 23 points which is associated with the Mathieu group M23 on 23 
letters. 


Define the block graph I of a quasi-symmetric design D by taking as 


vertices of T the blocks of D. Make two vertices adjacent iff the correspond- 


ing blocks intersect in x points. Let N denote the incidence matrix of D 


and A, the adjacency matrix of I’. Recall that we have already established 


the following matrix equations: 


NN‘ =(r—A)I AJ, NtJ=kJ, NJ =rJ 


Here, D is quasi-symmetric and hence the following matrix equation 


connects N*N and A: 


NN =kI +2A+y(J—I-A) 
=(k-y)I+(x@-yAt+yJ 
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Since N’N commutes with J (check this!) it follows that A also com- 
mutes with J which is just the same thing as saying that [ is a regular 
graph. If we can now show that besides the degree of regularity, A has ex- 
actly two other eigenvalues, then [ must be a strongly regular graph. But 
NN‘ and N'N have the set of non-zero eigenvalues. So, N‘N has eigenval- 
ues rk,r — \ and 0. The eigenvalue rk is a simple eigenvalue of N‘N that 
corresponds to the largest simple eigenvalue of A which is also the degree 
of regularity of I. Hence A has exactly two other eigenvalues proving that 
the block graph is strongly regular. 

The connection between quasi-symmetric designs and strongly regular 
graphs has paved way for a lot of recent interesting research work in the 
area of quasi-symmetric designs. Some prominent results in this area in- 
clude the classification of quasi-symmetric designs with z = 0 and those for 
which the strongly regular graph contains no triangles. This study was ini- 
tially undertaken by Baartmans and M. S. Shrikhande. This problem also 
has connections with the design extension theorem of Cameron mentioned 
earlier in this paper. 


Theorem 8.9. ([22]) Let D be a proper quasi-symmetric design with block 
intersection numbers x = 0 and y > 2. Then D has only a finite number of 
possibilities under any one of the following assumptions. 


(1) k is fixed. 
(2) y is fixed. 
(3) is fixed. 


This theme has been greatly explored in a number of recent publica- 
tions of Pawale. It was proved by Pawale and the author ([{17]) as well as 
Calderbank and Morton that: 


Theorem 8.10. Up to isomorphism, the only quasi-symmetric 3-design 
with (x,y) = (1,3) are the unique 4 — (23,7, 1)-design or its block residual. 


A paper of Sane and M. S. Shrikhande ({23]) began the task of classifying 
quasi-symmetric 3-designs. Various finiteness results were proved in that 
paper. It should be noted here that extremely few cases of the existence of a 
quasi-symmetric 3-designs are known, In fact, upto complementation, these 
are just the Witt designs mentioned in the above theorem. Classification 
of quasi-symmetric 3-designs with « = 1, and Pawale’s work in his Ph.D. 
thesis has also completed this for all quasi-symmetric 3-designs with x upto 
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100. Pawale has also obtained a classification of all quasi-symmetric 3- 
designs whose block graph is triangle-free and essentially with the exception 
of the Witt designs these are the Hadamard 3-designs. The classification 
programme of quasi-symmetric 3-designs has an obvious connection with 
what are called tight designs. Specifically, a tight 2s-design is one that 
has s blocks intersection numbers. It was proved by Ray-Chaudhuri and 
Wilson that any 2s-design must have at least s block intersection numbers 
(generalization of Fisher inequality for higher designs). 
Recall that an s-quasi-multiple design D’ has parameters 


y=9,b =o =skhk =bX =A 


Call such design proper if it is not obtained by simply repeating each 
block of a symmetric (v, k, \)-design s times. Jungnickel and Tonchev con- 
sidered proper s-quasi-multiples which are also quasi-symmetric. Let us call 
such a design a special design. Perhaps the existence question of special de- 
signs will shed some light on the connection between symmetric and quasi- 
symmetric designs. However, only one example of a special design seems 
to be known, the unique design with 22 points and 176 blocks which is a 
proper 8-fold quasi-multiple of a non-existent (22, 7,2) symmetric design. 
In that paper of Jungnickel and Tonchev, various results on special designs 
were proved. In particular, the authors proved that no such special designs 
which are quasi-multiples of projective planes exist. the author proved a 
classification result for special designs corresponding to biplanes, extending 
the results in Jungnickel and Tonchev. After partial results by Jungnickel 
and Tonchev ({12]) and Ionin and M. S. Shrikhande, the following result 
was proved by the author ({21)). 


Theorem 8.11. Let D be a s-fold quasi-multiple design with parameters 
v, sv, 8k, k, sX. Let D be also a quasi-symmetric design. Further assume that 
the parameters satisfy (k,(s—1)A) =1 (here (m,n) denotes the g.c.d. of 
m and n). Then D cannot be proper. That is, D must be a multiple of 
a symmetric design. 


Other than purely combinatorial techniques employed in the study of 
symmetric and quasi-symmetric designs, we also have techniques derived 
from linear algebra, number theory and coding theory. This article has dis- 
cussed linear algebra of designs and strongly regular graphs in some detail. 
The number theoretic techniques stem mainly from the Hasse-Minkowski 
theory of quadratic form as applied to a binary matrix. Coding theoretic 
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techniques mainly rely on the weight enumerator identities and deeper re- 
sults in the algebra of codes that use invariant theory. Though it is a fact 
that ‘most of the strongly regular graphs’ have trivial automorphism groups, 
looking for symmetries in symmetric and quasi-symmetric designs as well 
as strongly regular graphs, is one of the strong motivations for studying 
these combinatorial objects. At a meta level, it is to be expected that this 
study will also reflect on the structures of many interesting finite groups 
(and vice versa), the study of which has been carried out extensively in the 
last hundred years. 
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The concept of a determinant in an infinite-dimensional space is intro- 
duced, leading to the perturbation determinant for a pair of self adjoint 
operators (H,Ho) as a meromorphic function in complex plane. The 
winding number of this function counts the difference of discrete spectra 
of H and Ho. As an application the generalized Levinson theorem con- 
necting the number of bound states of H and the Krein’s shift function 
at 0+ is proved. 


9.1. Introduction 


One can extend the notion of a determinant to that of an operator of the 
form J+ T where T is a finite rank or trace-class operator on an infinite di- 
mensional Banach or Hilbert space respectively. This determinant satisfies 
many of the properties of the usual finite-dimensional one. This leads to a 
natural definition of a meromorphic function, called perturbation determi- 
nant, defined in an open subset of the complex plane for a pair (T, T+ A) 
where T is a closed operator and A is either finite-rank on a Banach space 
or trace class in a Hilbert space. This constitutes essentially the content of 
the next section which ends with a theorem, known as Weinstein-Aronsajn 
formula that connects the total number of eigenvalues of the pair (7, 7+ A) 
(counting multiplicities) with the poles and zeros of the perturbation de- 
terminant of the pair. 

The third section introduces the Witten index in terms of the trace of the 
difference of the resolvents of two self-adjoint operators on a Hilbert space 
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and proves a theorem about the invariance of such an index under “small” 
perturbations. The fourth section deals with Krein’s theorems where the 
trace of the difference of the resolvents mentioned above is given as an 
integral representation in terms of a real valued function defined on the 
union of the spectra of the two self adjoint operators. This function is called 
Krein’s shift function and it is shown that under suitable conditions, the 
Witten index equals (up to a sign) the value of the Krein’s shift function at 
0+. In the final section all these results are put together to show that in 
many situations of interest in the scattering theory in quantum mechanics, 
the shift function at 0+ is an integer (the total number of bound states in 
physicist’s language) so that the Witten index is a negative integer which 
is invariant under small perturbations. 


9.2. Perturbation Determinant 


Let X be a Banach space and let T € B(X), the Banach space of all linear 
bounded operators on X with Domain of T = X. Assume furthermore, that 
T is degenerate, i.e. range of T = Ran(T) is finite dimensional. In such a 
case, T can be represented as: 


Toa) <6; U> Yj (9.1) 
j=l 
where m = rank(T) = dim Ran(T); {y;}7L, is a basis for Ran(T’) and 


{e;}', are linearly independent elements of X*. Clearly Ran(T) is invari- 
ant under the action of T and if we denote by T” the restriction of T to the 
Ran(T), then we can define the determinant of J + T by 


det(I + T) = det(1’ + T’) (9.2) 
which implies that 
det(I + T) = det[d;, + (e;, yx). (9.3) 


In the identity (9.3), yx € Ran(T) is looked upon as a (finite - dimensional) 
subset of X and the right hand side is the usual determinant of matrices. 
With these definitions, it is easy to derive the property of such determi- 
nants: det{(I+S)(I+T)} = det(I+S) det(I+T), where T and S are two 
degenerate operators in B(X). If one defines furthermore the trace of T as 


trT =trT" =U, < 65, yy >, (9.4) 
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then one has the trace property 
trTA=trAT, VA€ B(X), (9.5) 


the proof of which reduces to the same for a pair of finite dimensional 
matrices. Furthermore the m x m dimensional matrix ((6;~% + < ej, Yr >)) 
does not have any zero eigenvalue. For its having zero eigenvalue is equiv- 
alent to the null space of (1' + T’) on Ran(T) being non-trivial. Let 
(1+ T)u = 0 for u € Ran(T). Then by (9.1), U7 < e;, wu > yy = —u 
and if we set u = yz, we get yr= —LM, < ej, Ye > yj to conclude that all 
the ys are not linearly independent contradicting the assumption of the 
m-dimensionality of Ran(T’). Thus we can define the operator In(J + T) 
to be 


In(((Oj;n-+ < ej, Ye >))) =m’ +T") € B(X). 


It is also clear from the definition that In(J+T) is also a degenerate operator 
with RanIn(J + T) = Ran(T). Therefore by the Jordan decomposition 
of T 


det(I + T) = det(I’ + T’) = Tji4(1 + A3)™ 


= exp(h'. 


4—1M; In(1 + Aj)) = exp(tr In(I’ + T’)) = exp(tr n(J + T)), 


(9.6) 


where (A;, m,;) are the eigenvalues and the multiplicities respectively for 
T’. Some more details on these can be found in Kato’s book ({8, 4]). The 
above observations, of course remain valid when X is a Hilbert space 7H. 
Moreover, since the infinite product [[(1 + A,;)'* converges to a non zero 
complex number if and only if U°°.,|A;| < oo, it is easy to see that det([+T) 
will be definable for all trace-class T’ € B, (7) i.e. T such that the singular 
values of T are summable. In such a case, det(J + T) = exp(tr n(J + T)). 

Reverting back to the Banach space scenario, let 7’ be a closed operator 
on the Banach space X and let A be relatively degenerate, i.e. for operator 
A(T — z)~! € B(X) and degenerate for some z (and hence for all z) in 
p(T), the resolvent set of T. In such a case we define for z € p(T), the 
perturbation determinant. 


A(T, T + A;z) = A(z) = det(I + A(T — z)~*). (9.7) 


It is also clear that in a Hilbert space H for A(T — Z)~+ € B,(H) for some 
z € p(T) the definition (9.7) will also make sense and 


A(z) = det(I + A(T — z)~*) = exp(tr In(I + A(T — z)~*)). (9.8) 
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In either case, it is also easy to see that for z € p(T) p(T + A), 
A(z) = det((T+A—z)(T—z)~') = [det((T—z)(T+A-—z)")]-}. (9.9) 


For proceeding further, we need the following two definitions. Let y be 
complex-valued meromorphic function defined in a domain 2 C C and we 
define the multiplicity function v(z, y) of y by 


Mm, if z is a zero of y of order m, 
viz, py) =< —m,_ ifzisa pole of y of order m, (9.10) 
0 for all other z € 2. 


Therefore v(z, y) takes values in Z except when y = 0 identically, in which 
case we set v(z, Y) = +00. Also we define the multiplicity function p(z, T) 
for a closed operator T on X by 


0, if z € p(T) , 
p(z, t) =< dim P_ ifz is an isolated point in o(T), (9.11) 
oe) otherwise, 


where P is the projection associated with the isolated point z € o(T’), given 
by the holomorphic functional calculus (see [4], p. 178). Then we have the 
theorem: 


Theorem 9.1. (Weinstein-Aronsajn formula): Let T be a closed operator 
on a Banach space X, and let A be a relatively degenerate operator on X. 
Furthermore, let A(z) be the perturbation determinant A(T, T+ A;z), and 
let Q be a domain consisting of p(T) along with the isolated eigenvalues of 
T with finite multiplicities. Then A is meromorphic in Q and we have 


p(z, T+A)=p(z, T)+v(z, A), Vz EN. (9.12) 


Since by definition, u(z, T’) is finite for all z € Q we have the alter- 
natives: either u(z, T'+ A) is finite for all z € Q or it is equal to +00 
identically. In the case of the first alternative, Q C p(T + A) except for at 
most countably many discrete eigenvalues of 7’ + A, while in the second 
Q Co(T+ A). 


Proof. In case A(z) = 0 identically, the relation (9.9) implies that 
(T + A—z)(T — z)~" has a zero eigenvalue for all z € Q which leads 
to the conclusion that z is an eigenvalue for T+ A for all z € p(T) NQ. But 
this means by definition that u(z, T+A) = +00 since either p(T)NQ = 0 in 
which case z is not an eigenvalues of T+ A and 2 C o(T) or p(T)NQ FD 
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when z is an eigenvalue, not isolated. Thus we can assume without loss 
of generality that A is not identically zero, we consider first z € p(T) 
such that A(z) # 0 and set B(z) = A(T — z)7! so that by (9.8), 
A(z) = det(J + B(z)) = expltr In(J + B(z))]. That A(z) 4 0 implies that 
B(z) has no eigenvalues —1, and in such a case by holomorphic functional 
calculus ([4], p. 178) 


In(I + B(2)) = = [ In(1 +6)(5 — B(z)) “tds, 


where C is a closed curve enclosing all the eigenvalues of B(z) (they are 
finite in number since B(z) is finite-rank) and excluding the point {—1}. 
We note that 


d d 1 d 
<n A(2) = Sirin( + B(2)) = Saget” | int + O(C — B(2)) Mae. 
(9.13) 
Interchanging the order of integration with taking trace and differentiation, 
which is permissible because the function z — (¢ — B(z))~1 is differentiable 
in trace, we note that: 


= (¢ — B(z)) "(A(T — 2)-7)(6 — Ble) 


and substituting this in (9.13) and using the trace property, we have 
5 mA(Z) = 5 f trlin(a + 6)(6 - BG) 2B" (w)lac 
dz 7 271 C 


—- fe T{=In(1 + 6)er((6 - B(z))~1B"(z))}d¢ 


Toa ao + ¢)~*tr((¢ — B(z))~*B"(z))d¢ 
=tr{[(1+ Bea ee (9.14) 


where we have observed that the first integral in the second step vanishes 
since [In(1+¢)tr {(¢— B(z))~!B’(z)}] is an holomorphic function of ¢ inside 
C’.. On the other hand, 


tr[(1 + B(z))*B"(z)] = tr[(T — 2) (1+ AT — 2) (P - 2)" 
=n +A—2) AT —2) =-tr[(T +A-—2) + -—- 7-2)". 
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Thus for z € p(T’) such that A(z) 4 0, 
MTA (Ay = £m A(2). (9.15) 


In the above we have also noted that since (T+ A-—z) = (J+ B(z))(T—z) and 
since for such z, + B(z) is bounded invertible it follows that z € p(T’+ A). 
Finally, to obtain Weinstein-Aronsajn (W-A) formula, we need to integrate 
the relation (9.15) and realize that 4 f., 4 m A(z) = v(A, A) for a small 
circle C enclosing \ € 2, and also that +4 J, (T — z)~'dz = P, the eigen- 
projection for the eigenvalue of T and p(A, T) = dimP = trP. O 


Corollary 9.1. Let H be a separable Hilbert space, T a closed operator on 
H and A € By(H). Then also W-A formula (9.12) is true. 


The proof follows by observing that the passage from equation (9.13) to 
(9.14) remains valid by an application of dominated convergence theorem 
in trace-norm, to interchange the integral and trace ;and then the rest of 
the steps also remain valid. 


Remark 9.1. Let 7 be a separable Hilbert space, 7, and T> two closed 
operators on H such that [(T2 — z)~1 — (T, — z)~'] € Bi(H) for some z € 
p(T) Np(T2). Then (Tz —z)~1—(T,—z)—! € By(H) for all z € p(T1)Np(T2). 
This easily follows from the resolvent equation. 


9.3. Witten Index and Its Invariance 


For the rest of this article 1 is a separable Hilbert space, and let H and Ho 
be two self joint operators on H. We also assume that [(H — z)~! — (Ho — 
z)~1] € B,(H), the Banach space of trace-class operators on H, for some z 
(and hence every z) in p(H) p(Ho). Then the Witten index 6(H, Ho) for 
the pair (H, Ho) is defined as: 


6(H, Ho) = [—ztr{(H —z)+-—(Ho—z)~"}], (9.16) 


lim 
z—0:|Re z|<C|Im 2| 
whenever this limit exists. This is called the non-tangential limit and using 
the resolvent equation, one can easily establish that the limit, if it exists 
for some non-tangential ray in the complex plane, then it exists for every 
other such ray and they are equal. Thus without loss of generality we shall 
assume that z is pure imaginary for the purpose of the limit. 

For the rest of this section, H = L?(R*), Hp = —A, the self-adjoint 
operator on H associated with the Laplacian, and V; : 7 = 1,2 the operators 
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of multiplication by real measurable functions V; € L? N L1(R). It is well 
known ((6, 7]) that H = Hp + V; and Hy = H+ aV, for all a € R are self- 
adjoint operators with D(H.) = D(H) = D(Ho), where we have written 
D(A) for the domain of definition of a linear (not necessarily bounded) 
operator A on H. Next we note that V;(a) = sgn[V;(x)]|V; (a) |2. |V;(x)|? = 
V;(2)2|V;(2)|? so that V;?, |Vjl3 € L2(R9). 

Then we have the following sets of results which are either available 
in [6, 7] or easily provable. 


Theorem 9.2. Let H,Ho,V; be as above. Then 
(i) V7? (Ho — z)\Y%,|2 = M;x(z) are Hilbert -Schmidt and ||Mjx\l2 < Gyr, 
independent of z for all z € p(Ho). 


(ii) For all z with Im z # 0, the bounded operator I+V? (Hy—Z)" | V; |? is 
bounded invertible. If furthermore, the point 0 is not an exceptional 
point for the self-adjoint operator Hj + V; (see page 433 of [7]), then 
IZ + V;? (Ho — z)|V;|2]-1|| < Cy , independent of z with Re z in a 
neighborhood of {0}. 


(iti) For any z with Im z # 0,[(H — z)~! — (Ho — z)74], [((Ha — z)71 - 
(H — z)~*] and [(Ha — z)~! — (Ho + aV2 — z)~) are all trace - class. 


(iv) Under the condition of part (ii), the Witten index 6(H,, H) = 0 for 
sufficiently small |a| and therefore, 6(Ho + aV2, Ho) = 0 for sufficiently 
small |a| by setting V; = 0. 


(v) The Witten index is invariant under “small perturbation”, i.e. for suf- 
ficiently small |\a| 


6(Hp + Vi + aVa, Hy + aV2) = 6(H) + Vi, Hb), (9.17) 


in the sense that if one exists so does the other and they are equal. 


Proof. Sketch: 
(i) By the hypothesis on the functions V;, it is clear that V; € L?(R?). Also 
the operator M;,(z) is an integral operator with kernel 


1 eV2|x—y| 1 
Myx. (2)(x, y) = Vj (x)? Ine ay EW? 


) 
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where the branch of the square root with Im,/z > 0 is taken. Thus by 
Hardy-Littlewood-Sobolev inequality (see [6]) 


ft o-2llmy2||a—y 
| Mjx(z)ll2 = T6n2 / IV5(@)| a Ve) ide dy 
<C"||Villal|Vells = Bix, 


independent of z. 


(ii) We have seen that M;;(z) € Bo(H), therefore compact. Also from 
the kernel of M;;(Z) and the choice of the branch of \/z, it is clear that 
p(Ho) > z+ Mj;(z) is Bo analytic, since V; € L*. Furthermore, from the 
resolvent equation (H + V; — z)~!— (Hp —z)7! = —(Ho — 2) 71V;(H —z)7! 
for z € p(H)N p(Ho), it follows that, for Im z 4 0, 


(1+. V;? (Ho — Z)"|Vj|?)"! = (1- VP (A+ Vj — 2) 1/2) € BIH). 
Therefore by the analytic Fredholm theorem ({6]), 
[1 + M,;(A + i0)|-* € B(H) 


for all \ € R except for a closed set of Lebesgue measure 0. If for AX = Ao, the 
above operator is not invertible then it follows that g,, = (Ho—Ao)~'|V| 2 f 
with f in the null space N[1+ M;;(Ao +7 0)] is the eigenvector of the oper- 
ator Ho + V;. Since by hypothesis Ap = 0 is not an eigenvector of Ho + Vj, 
the part (ii) is proven. 


(iii) Since if V; € L?(R3), then V;(Hp — z)~! € Bg and 


Vj (Ho—2) 412 = — } / yee aay eowe— 
— 2 167? : |a — y|? a IN2 Im fz 


for Im z #0. This along with the resolvent equations verify (iii). 


(iv) Since D(H) = D(Ho) = D(Hp + V2), for z with Im z £0 
(H — zy? — (Ho - ay = —(Hp — z) Vi (HH = ie 
and by (i) and (ii) we have 
Mye(z) = Vii (H — z)71|Vel? = Mjx(2) — My (2)Mix(2). 
Thus for Im z 4 0, 
Mu(z) = (1+ Mu(z))' Mi (2) 
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and similarly ((V2(H — z)7!) = [1 + Mu(z)|71(V2 (Ho — z)7!) € Bo(H). 
One can show exactly in the same way that 


V2(H + aV 9 —z)7! = [1 — aMgo(z)]71[1 + Mir(z)|71[V2(H — z)7!] and 
Mio(z) = [1 + Mui(z)]7!]Mio(z). (9.18) 
By (ii) and equation (9.18), we get that, SUP 2€\0,i] | My2(z) lla< C’” Bie 
and then by choosing |a| sufficiently small, 
\|V 2 (H+aV3—2)||2 < (C”)?(1-|a|G12)7"||V 2 (Ho —2)||2 = D (Imyz)~?. 
(9.19) 
Finally putting all these together, we have that for z = iy, 
|tr[(H + aV2 — iy)~* — (H —iy)~]| 
< |a|||(H — ty)" Vo(H + aVe — iy)* 1 
< |al||| Voll? (H+ ty)" ||2|| Vo? (H + aVe — iy)" |I2 < D! yo? 
and hence 


lim  —z tr[(H +aV2—2z)7' -(H-2z)""| 


y—0,z=1y 
= lim —iy tr|(H + aV2 —iy)~* — (H —w)] =0 
y= 


for |a| sufficiently small. Setting Vi = 0, for small enough |a| we get 


lim | [(Ho + V2 —4 y)™ — (Ho — ty)~"] = 0. 
yorr—ry 


(v) Finally, since 
tr[(Ho + Vi + aV2 — iy)? — (Hp + aVe — iy)~"] 
—tr[(H — iy)~ — (Ho — iy)7)] 
= tr[(Ho + Vi + aV2 — iy)? — (Ho + Vi — ty) 
—tr[(Ho + aV2 — iy)! — (Ho — iy)~*] 


it follows by result of (iv) that under the assumption in (ii), for sufficiently 
small ja|, 6(H +aVo, Ho +aV2) = 6(H, Ho). O 


In the special situation where H, and Ho are positive self adjoint op- 
erators A*A and AA* respectively, with A a Fredholm operator on some 
Hilbert space, then the Witten index for the pair 6(Hi, H2) is equal to the 
Fredholm index of A, which is an integer. But in a general case, the Witten 
index need not be an integer. 
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9.4. Krein’s Shift Function 


Given two self adjoint operators on a Hilbert space such that their difference 
is a trace-class operator, there is an L1-function € (called Krein’s Shift 
function ) such that the integral of € equals the trace of the above difference. 
The value of € at zero, if it exists, has many properties of an index. 


Proposition 9.1. Let Ho be a self-adjoint operator on a Hilbert space 
H(dimH < co) and let V be another self adjoint operator. There exists 
a unique real-valued function € € L°(R), supported in o(Ho) Uo(Ho + V) 
such that for every polynomial p on R. 


tr{p(Ho + V) ~ p(Ho)] = f p'Q)E()ad. 
In fact, by the spectral theorem and functional calculus, one has that 


p(Ho + V) — p(Ho) = / p(A) [E(dA) — Eo(dX)), 


where Ey and E are the spectral family of the self-adjoint operators Ho 
and Hj) + V respectively. Clearly these integrals are actually sums and the 
integral is over the union of the two spectra contained in some interval [{a, }}. 
One can do integration by parts to get 


p(Ho + V) — p(Ho) = - p'()[Eo(A) — B(A)]d 


and the expression for the trace follows with €(A) = tr[Eo(A) — E(A)]. 
In infinite dimensional #1, the story changes drastically, even for a pair of 
bounded self adjoint operators they can have only continuous spectra and 
moreover,they can be pair of unbounded self-adjoint operators. We state 
the next two theorems without proof, which can be found in [1, 3, 5}. 


Theorem 9.3. Let H and Ho be two self adjoint operators on a separable 
Hilbert space H such that V = H — Hp is trace-class (i.e € By(H)). Then 
there exists unique L'-function € with support in o(H)Uo(Ho) such that 


(i) trV = f &(A)d), 
(it) €A)|aX< [Vhs - 
(itt) for p : R > C such that (A) = f[ —<=*dv(t)+ constant, with v a 
complex measure in R, 


o(H) — o(Ho) € Bi(H) and tr[y(H) — y(Ho] = / ol (AE (A)Ad, 


Index Theorem 167 


(iv) €(A) = 4 limejo IMIn A(A + ie) for almost all d. 

Theorem 9.4. Let H and Ho be two self adjoint operators such that 

(H — z)~' — (Ho — z)~' € Bi(H) for some z € p(H)O p(Ho). Then there 
exists a real-valued measurable function € on R, unique up to an additive 
constant such that 


. 
(i) po # GO; 
(ii) tr ((H - 2“) - (Ho - 2) = - f f&aae 
(iti) For every function yp :R — C with the property w(A) = (1+A?)~*y() 
where yp satisfies the condition of (iii) of the previous theorem, 


y(H) — y(Ho) € Bi(H) and tr[h(H) — (Ho) = f BI(ME(AAD 


The next theorem (cf. [2, 8]) connects the Witten index 6(H, Ho) and 
the Krein’s shift function €(H, Ho; A) for the pair (H, Ho). 


Theorem 9.5. Let H and Ho be self-adjoint operators bounded below on a 
Hilbert space H such that [(H — z)~1— (Ho — z)~*] € Bi(H) for z € p(H)n 
p(H2). Assume furthermore that € is locally bounded and piecewise contin- 
uous. Then 6(H, Ho) exists and is equal to €(H, pal — €(H, on 


Proof. Let f(z) = —ztr[(H - z)"! - (Ho - 2)4] = -2 f2; a > aed 
where 6 = —min(info(H), info(Hpo)) and we cs written SA 3 ri 
€(H, Ho; A). Thus by an integration by parts, we have that 


. E(A) _ z O 2 
ef pO = es) +60) + eno 


In the above we have used the fact that under the hypothesis of the theorem, 
€ is locally a signed measure. The non-tangential condition | Re z| < C|Im z| 
leads to the bound |x=;| < VC? + 1 and therefore, by the dominated con- 
vergence theorem, 


0 
: gO) 
eapeceins 2| -2f (A _ nee — €(0_). 


An identical argument proves that 


_ 
20:|Rez|<O|Im z| Jo A? + las a ey). 


On the other hand for |z| < a 


© g(x) fT EO) 2 della? +1) £* (EQ 
ef C= ae FL G ayer = Gp [oor 
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and it goes to 0 as z tends to 0. Thus 6(H, Ho) exists in this case and 
6(H, Ho) = €(0_) — €(0+). 0 


9.5. Application to Quantum Mechanics and Generalized 
Levinson’s Theorem 


Next, we go back to the quantum mechanical case addressed in 
Theorem 9.2: H = L?(R*), Ho = —A, the potentials V; € L? M L'(R?) 
for 7 = 1,2. In such a case it is known ((6, 7|) that Ho, H = Hp +Vi, Ha = 
H + aV2 are all bounded below and that (real) negative spectra are re- 
spectively o_(Ho) = @ and o_(H),o_(Hq) consists of finite number of 
discrete eigenvalues with finite multiplicities. Therefore the open interval 
(—K, 0) € p(H) and p(H,) for & > 0 small enough and therefore by the 
definition of € (see Theorem 4.2), €(A) = 0 for A € (—«, 0) and thus 
€(H, Ho;0_) = (Ha, Ho;0_) = 0. In such a case the conclusion of the 
Theorem 4.4 would be that the Witten index 6(H, Ho) exist and is equal to 
—€(H, Hpo;0+), provided the other assumption on the shift function €, viz. 
that € is locally bounded and piecewise continuous, is valid. 

If these extra hypothesis on €(H,Ho;A) and on €(H,, Ho + aV2;X) are 
satisfied, then combining all the above results we conclude that the Witten 
indices 6(H, Ho) and 6(H., Ho + aV2) exist and are equal, and therefore, 


6(Ha, Ho +0V2) = —€( Ha, Ho +aV2;0+) = 0(0, Bo) =—€(H, Ho; 0+). 
(9.20) 


The above relation (9.20), which on the one hand describes the invariance 
of the Witten index under small perturbations, can be understood, on the 
other hand as a consequence of what may be called Levinson’s theorem 
which we go on to describe now. Originally Levinson proved a similar the- 
orem in H=L?(R) i.e. where Ho and H are ordinary differential operators. 
The starting point is W-A formula or rather the relation (9.15) with a pair 
of self adjoint operators Hp and H = H)+V ona Hilbert space H replacing 
T and T + A such that [(H — z)~! — (Ho — z)~'] is trace class for some 
(and hence for all) z € p(H) NM p(Ho); this along with Krein’s Theorem 9.4 
leads to 

tr[(H —z) 1 -(Hp-z) t= _— In(A) = — / aw. (9.21) 
Theorem 9.6. Let H and Ho be two self adjoint operators on a Hilbert 
space such that (H — z)~! — (Ho — 2)7+ € Bi(H) for some z € 
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p(H)()p(Ho). Assume furthermore the following: 


(i) H and Ho bounded below, i.e. H,H) > —G, the discrete spectrum of 
H contained in |—b, —a] withO <a<b< 8 < o and Ho has no point 
spectrum, 


(ii) the associated Krein’s shift function €(H, Ho; A) = €(A) locally bounded 
and piecewise continuous, 
(itt) E(+00) = limy.4.00 €(A) exists and is equal to 0. 


Then €(H, Ho;0+) =n, where n = Xi), cca:.0(H) (Aa, A). 


Proof. The relation (9.15) is integrated along the contour given below. 


We note the following, the function z +> tr[(H — z)~! — Hp — z)~+] is 
continuous on the contour C,,r = Cr U (Ci; UC2) UC,, and is meromo- 
prphic in the interior with poles at {A;}, the discrete spectrum of H with 
(Ai, H) = dim P; < co and 


— tr[(H — z)~! — (Ho — z)"|dz = DjdimP, = Djp(A;, H) =n. 
272 Cow 

On the other hand, setting y(z) = tr[(H — z)~' — (Ho — z)~"], we note 
that f., y(z)dz = — J, + mA(z)dz and since 4 n A(z) is holomorphic 
in compact region containing C’., it is uniformly bounded in that region, 
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say by c’ so that Ifo. y(z)dz| < arc’ +0asr— 0. For the integral on 
Cr, we use (9.21), to get that 


r)| / |E(A)| 
x < : 
fo eae [. lal | Grp Roo 02 aan Se: Waraoreaa =| pam 


Next we note that the support of the A-integral is contained in |[—b, —a] U 
[0, co) by (i) and we write 


(A ~ |EO "NEO 
Rf em = Ff, ee | eee 


+Rf se vee 


where given € > 0 we have chosen N so large positive that |€(A)| < € for all 
A\ > N by (iii). Since by (ii), € is bounded locally, the first two integrals can 
be estimated by C; tan7! am where C; and C2 are two positive constants. 


Therefore the first two integrals converges to zero as R — oo while the 
third integral can be bounded by 45£. This proves that | Sor ~p(z)dz| — 0 
as R — oo. That leaves the integrals on the lines C, and C2. But 


[. p(z)dz + i. p(z)dz 


= [In A(ir) — In A(—ir)] — [mn AGR) — n A(-7R)| 

= 2i{Im nA(i r) —Imn ACiR)}. 
From the definition of the perturbation determinant A(z), it is clear that 
In A(iR) — 0 as R — o and from the property of € as in Theorem 9.4 and 


assumption (ii) of this theorem it follows that Im In A(ir) — w€(H, Ho; 0+) 
as r — 0+. Finally putting all these together one gets that: 


= 1 ; =) —1 
n= 5 lim, oe — (H -—0-2Z)~‘|dz 


21 
-- Pra CLe Ho;0+) = €(H, Ho; 0+), 
ri 


which is the generalized Levinson’s theorem. CL) 


Combining this with the Theorem 9.2 and the discussions at the begin- 
ning of this section, we get the following corollary relevant to the quantum 
mechanical situation. 


Corollary 9.2. Let Hp) = —A, H = Ho + Vi and Hy = H+ av, (with 
a real) be the three self adjoint operators on the Hilbert space H = L?(R°*) 
with V; (j=1,2) real valued functions in L? Q L'(R*) as in Theorem 8.1. 
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Assume furthermore as in the discussion in the first paragraph of this 
section that all the relevant Krein’s shift function satisfy the properties of 
(it) and (iti) of Theorem 5.1, viz. that they are locally bounded and piecewise 
continuous and they converge to zero at infinity. Then 


O( Aes Hp + aV2) = 6(H, Ho) = —€(H, Ho; 0+) =-n 
for sufficiently small |a| and where n = Uju(A;, H) as in Theorem 9.6. 


The proof follows by combining the earlier discussions and Theorem 9.6. 
In the language of physics, the corollary states that the number of bound 
states (i.e. the discrete spectrum, counting multiplicities) of the Hamilto- 
nian operator H, = Hp +aVi+aV% for |a| sufficiently small is independent 
of a. 
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Let us begin with some elementary observations. Let A be a Dedekind 
domain (see [8], Ch. V). Two basic examples are obtained as follows. 


(i) Let K be a finitely generated extension field of transcendence degree 1 
of an algebraically closed field k, t € K a transcendence basis, and 


A = integral closure of k[t] in K. 


These rings arise as the coordinate rings of nonsingular affine algebraic 
curves over k. In this lecture, we focus on this example and some natural 
generalizations. 

(ii) Let K be an algebraic number field (finite algebraic extension of Q), 
and let 


A = integral closure of Z in K. 
This is called the ring of algebraic integers in K. 
For a Dedekind domain A, let 
Div (A) = Free abelian group on maximal ideals of A 
= Group of divisors of A. 


If a € A is a non-zero element, it defines a principal ideal aA, which 
has a factorization of the form aA = Sit ---Mt?, where M; are distinct 
maximal ideals, and n; are positive integers (if a is a unit, then aA = A, 
and we take r = 0). Equivalently, aA has a primary decomposition aA = 
MA ++. A Mr (the equivalence is a property of Dedekind domains). 
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The assignment 


i=1 
gives a semigroup homomorphism 
A \ {0} > Div (A). 


The subgroup P (A) Cc Div (A) generated by the image of A \ {0} is called 
the group of principal divisors. 
We may now define 


Cl(A) = Divisor class group of A 


_ Div (A) 
~ P(A) ~ 


Theorem 10.1. For a Dedekind domain A, the following properties are 
equivalent. 

(i) A has the unique factorization property. 

(ii) Every maximal ideal of A is a principal ideal. 


(iti) C1(A) = 0. 


Thus the divisor class group measures the failure of the unique factorization 
property. 

The divisor class group of a Dedekind domain has an interpretation in 
terms of algebraic K-theory (see [5]). 

Recall that a projective A-module is a direct summand of a free A- 
module. Let Ko(A) be the Grothendieck group of finitely generated projec- 
tive A-modules, defined as a quotient 


F(A) 
Ko(A) = —— 
with 
F(A) = Free abelian group on isomorphism classes of 
- finitely generated projective A-modules 
R(A) = subgroup of F(A) generated by classes [P2] — [P;] — [P3] 


~ for all exact sequences 0 — P, — Pp — P3 — 0. 


Since a Dedekind domain A has Krull dimenion 1 (i.e., every non-zero 
prime ideal is a maximal ideal), one can show that there is an isomorphism 


Ko(A) = Z@Cl(A). 
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The map Ko(A) — Z is induced by 
[P] + rank P = dimx P @, K, 


where K is the quotient field of A. 
To define the map Ko(A) — Cl(A), one first identifies Cl(A) with the 
Picard group 


Pic (A) = isomorphism classes of projective A-modules of rank 1, 


which is a group with respect to the tensor product, with the class of the 
free module A as identity element. 

If St is any maximal ideal of A, then it is in fact a projective A-module 
of rank 1, and so has a class in Pic(A). This defines a homomorphism 
Div (A) — Pic (A), which vanishes on P (A) because a principal ideal aA is 
a free module, which represents the identity element of Pic (A); thus, there 
is an induced homomorphism Cl(A) — Pic(A). For certain “traditional” 
reasons, one usually multiplies the above map Cl(A) — Pic (A) by -1; it 
can be shown that it is an isomorphism. 

Now if P is a projective module, say of rank s, we can associate to it 
the element 


[det(P)] = [AP] € Pic (A), 


the determinant of the projective module P, and (using the above iden- 
tification), there is a corresponding element in Cl(A). Thus we have a 
composition 


Ko(A)““4Pic (A)—>Cl (A). 


To make a connection with geometry, let A be a Dedekind domain which 
is the coordinate ring of an affine algebraic curve C, over an algebraically 
closed field k ((see [2]). The points of C are in bijection with the maximal 
ideals of A. We write C = SpecA to denote this relationship. Finitely 
generated projective A-modules correspond to vector bundles on the curve 
C, giving an isomorphism of Ko(A) with the Grothendieck group Ko(C) of 
vector bundles. 

There is an associated projective non-singular curve X over k, such 
that C = X \ S for some finite, nonempty set S of points of X. If K is the 
quotient field of A, then the points of X are in bijection with the discrete 
valuation subrings of K which contain k (see [2], Ch. I). This determines X 
as a set, and the projective algebraic structure of X may also be determined 
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intrinsically, using the algebraic structure of the field K, which is then 
identified with the field of rational functions on the curve X. 

There is a similarly defined Grothendieck group K(X ) of vector bundles 
(or locally free sheaves) on X. If Pic (X) is the group of isomorphism classes 
of algebraic line bundles on X, one can show that there is an isomorphism 


Ko(X) ¥ Z@ Pic (X), 


again given by the rank and determinant for vector bundles. 
There is also an isomorphism Pic (X) = Cl (X) (which we may view as 
the algebraic first Chern class), where Cl(X) = Div (X)/P (X), with 


Div (X) = Group of Divisors on X 
= Free abelian group on points of X, 


P(X) = Principal divisors 
= Divisors of non-zero rational functions on X. 


If f is a nonzero rational function on X, its divisor is 
div (f) = (zeroes of f) — (poles of f) 


where the zeroes and poles are counted with multiplicities. 
Finally, there is a relationship between the class groups of X and of 
C = Spec A = X \ S, given by an isomorphism 
Cl1(A) = Cl(C) 
~ Cl(X) 
~ Subgroup generated by points of S: 
The geometry comes in now, in terms of a structure theorem for Cl (X). 
There is a degree homomorphism 


deg : C1(X) — Z. 


This is well-defined, because any nonzero rational function on X is known to 
have the same number of poles and zeroes, when counted with multiplicities. 

Now an important theorem in algebraic geometry is that the kernel 
Cl(X)dego has the structure of an abelian variety (a projective algebraic 
group variety), called the Jacobian variety, which is denoted J(X), and 
whose dimension is the genus of the non-singular projective curve X. 

If the ground field k is C, the field of complex numbers, then the complex 
points of X naturally form a compact Riemann surface, and the “algebraic 


Zero Cycles 177 


genus” of X, as considered above, equals the “topological genus” of the 
underlying compact connected oriented 2-manifold. The Jacobian J(X) is 
identified, as a real Lie group, with the torus H'(X,R/Z), which has a 
natural complex structure coming from the theory of harmonic forms. 

The genus may be defined algebraically as the dimension of the vector 
space of algebraic regular 1-forms on the curve X, or equivalently (using 
Serre duality) as the dimension of the sheaf cohomology H'(X, Ox) of the 
sheaf Ox of algebraic regular functions on X. 

We now have the following remarkable result: 


Theorem 10.2. Let C = Spec A be an affine algebraic curve over an alge- 
braically closed field k, where A is a Dedekind domain (i.e., C is a nonsin- 
gular algebraic curve). Let X be the corresponding nonsingular projective 
algebraic curve over k. Then the following are equivalent: 


every maximal ideal of A is principal = Cl1(A) =0<¢ 
the projective curve X has genus 0, i.e., H'(X,Ox) =0. 


This is because of a property of abelian varieties: if the Jacobian J(X) is 
nonzero (equivalently the genus of X is nonzero), then J(X) is not a finitely 
generated group. This means that for A as in the Theorem, we either have 
that Cl(A) = 0, or that Cl(A) is not finitely generated. 

A refinement of the above result applies to an arbitrary reduced, finitely 
generated k-algebra A of Krull dimension 1, where & is an algebraically 
closed field. 

Once again, one can uniquely associate to C = SpecA a projective alge- 
braic curve X, such that C = X \ S for a finite set S of nonsingular points 
of X. The dimension of the sheaf cohomology group H'(X,Ox) is called 
the arithmetic genus of the projective curve X, and equals the dimension 
of a certain algebraic group variety called the generalized Jacobian of the 
(possibly singular) curve X. 

A point x € C is a nonsingular point if for the corresponding maximal 
ideal IN, of A, the localization Agr, is a discrete valuation ring; C' has only 
a finite number of singular points. The maximal ideal corresponding to a 
nonsingular (or smooth) point will be called a smooth maximal ideal. 


Theorem 10.3. Let C = Spec A and X be as above. Then: 


every smooth maximal ideal of A is principal = 


the projective curve X has arithmetic genus 0. 
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We now turn to the higher dimensional case. 

If A is a finitely generated k-algebra which is an integrally closed domain 
(i.e., a normal domain) of dimension d, one can associate to it a divisor class 
group, defined using Weil divisors (free abelian group on irreducible d — 1- 
dimensional subvarieties), with a relationship to the theory of the Picard 
variety. One result obtained from this theory is the following (it can be 
refined in several ways, which we do not go into here). 


Theorem 10.4. Let A be a finitely generated k-algebra, which is an integral 
domain. Then the following properties hold. 


(i) The group of units of A is of the form 
A* = k* x (free abelian group of finite rank). 


(ii) Assume that A is normal. Let X be a normal projective algebraic k- 
variety containing V = Spec A as a dense Zariski open set. Then the 
divisor class group of A is finitely generated <= the Picard variety of X 
(in the sense of Weil) is trivial. 


Another generalization, which is our main interest here, is to consider 
the complete intersection property for maximal ideals. 

Let A be a reduced, finitely generated algebra of Krull dimension d, 
over an algebraically closed field k. Let V = Spec A be the affine variety 
associated to A, so that maximal ideals of A correspond to (closed) points 
of V. A point x € V, corresponding to a maximal ideal 9%, in A, is called a 
complete intersection point if the ideal SN, is of height d, and is generated 
by d elements f;,..., fa. Geometrically, this means that if H; C V is the 
hypersurface defined by 


A; = {y € V|fily) = 0} = Spec A/fiA, 


then H,1--- Hq = {x}, and x € V is a nonsingular point, such that the 
hypersurfaces H; are also nonsingular at x, and intersect transversally. 

Note that when d = dim A = 1, MN is a complete intersection = I is a 
principal ideal. 

Recall that a point x € V is a smooth (or nonsingular) point if the local 
ring Oy, = Am, is a regular local ring of dimension d, in the sense of 
commutative algebra; this means that St, has height d, and the localized 
maximal ideal St, Agr, is generated by d elements. 

Equivalently, there is an affine Zariski open subset W C V containing 
x such that « € W is a complete intersection point. Thus, we may view a 
smooth point « € V as a “local complete intersection point”. A maximal 
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ideal IN of A is called a smooth maximal ideal if it corresponds to a smooth 
point of V = Spec A. 

The main question we want to discuss is the following. 
Question: Which k-algebras A of dimension d > 1 have the property that 
all smooth maximal ideals are complete intersections? In other words, when 
are all local complete intersection points on V = Spec A the same as the 
complete intersection points? 

There are several conjectures and results related to this Question. We 
first state a general “positive” result. 


Theorem 10.5. Let k = F, be the algebraic closure of the finite field Fy. 
Then for any reduced finitely generated k-algebra A of dimension d > 1, 
every smooth maximal ideal is a complete intersection. 


In the case when dim A > 3, or A is smooth of dimension 2, this is a result 
essentially due to M. P. Murthy. The higher dimensional case is reduced to 
the 2-dimensional case by showing that any smooth point of V = Spec A 
lies on a smooth affine surface W C V such that the ideal of W in A is 
generated by d— 2 elements (i.e., W is a complete intersection surface in 
V). This argument depends on the fact that we are dealing here with affine 
algebraic varieties. (see [6]). 

The case of an arbitrary 2-dimensional algebra is a corollary of results 
of Amalendu Krishna and mine (see [3], [4]). 

Next, we state two conjectures, which are affine versions of famous con- 
jectures on O0-cycles. 


Conjecture 10.1. (Bloch Conjecture). Let k = C, the complex numbers. 
Let V = Spec A be a non-singular affine C-variety of dimension d > 1, and 
let X D V be a smooth proper (or projective) C-variety containing V as a 
dense open subset. Then: 

all maximal ideals of A are complete intersections 

<= X does not support any global regular (or holomorphic) differential d- 
forms 

& HX, Ox) = 0. 


Here, Ox is the sheaf of algebraic regular functions on X. The non- 
existence of d-forms is equivalent to the cohomology vanishing condition, 
by Serre duality; the open question is the equivalence of either of these 
properties with the complete intersection property for maximal ideals. 

This conjecture has been verified in several “non-trivial” examples (for 
example, if V = SpecA is a “small enough” Zariski open subset of the 
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Kummer variety of an odd (> 1) dimensional abelian variety over C, all 
smooth maximal ideals of A are complete intersections). 

One consequence of the conjecture is that, for smooth affine C-varieties, 
the property that all maximal ideals are complete intersections is a bira- 
tional invariant (that is, it depends only on the quotient field of A, as a 
C-algebra). This birational invariance can be proved to hold in dimension 
2, using a result of Roitman; in dimensions > 3, it is unknown in general. 


Conjecture 10.2. (Bloch-Beilinson Conjecture) Let k = Q be the field 
of algebraic numbers (algebraic closure of the field of rational numbers). 
Then for any finitely generated smooth k-algebra of dimension d > 1, every 
maximal ideal is a complete intersection. 


This very deep conjecture has not yet been verified in any “nontrivial” 
example (i-e., one where there do exist smooth maximal ideals of A Wg C 
which are not complete intersections). 

However, it is part of a more extensive set of interrelated conjectures (see 
[1]) relating K-groups of motives over algebraic number fields and special 
values of L-functions, and there are nontrivial examples where some other 
parts of this system of conjectures can be verified. This is viewed as indirect 
evidence for the above conjecture. 

I will now relate these conjectures to algebraic cycles and K-theory. 

The first step is a result of Murthy, giving a K-theoretic interpretation 
of the complete intersection property. 

Recall that Ko(A) denotes the Grothendieck group of finitely generated 
projective A-modules. If M is an arbitrary finitely generated A-module, 
recall that M has finite projective dimension if there exists a finite projective 
resolution of M, i.e., an exact sequence 


0— Peo Raa PRO M0 


where the P; are finitely generated projective A-modules. Then M has a 
well-defined class [M] € Ko(A), obtained by choosing any such resolution, 
and defining 


i 


(M] = S7(-1)'[P] € Ko(A). 


i=0 


Recall also that a maximal ideal 97 has finite projective dimension pre- 
cisely when the local ring Agy is a regular local ring. 
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Theorem 10.6. ([7]) Let A be a reduced finitely generated algebra over an 
algebraically closed field. A smooth maximal ideal IN of A is a complete 
intersection <> its class in Ko(A) vanishes. 


Let A be a reduced, finitely generated algebra, of Krull dimension d, over 
an algebraically closed field k. We can associate to it the group F¢Ko(A), 
the subgroup of Ko(A) generated by the classes of smooth maximal ideals. If 
V = Spec A, then F?Ko(A) is a quotient of the free abelian group on smooth 
points of V, modulo a suitable equivalence relation. Murthy’s theorem in 
fact is stronger: if V is non-singular, it identifies the above equivalence 
relation with rational equivalence, coming from algebraic geometry. 

Recall that the Chow group of 0-cycles modulo rational equivalence on 
a smooth d-dimensional variety X is 


_ 24%) 


CH%(X) R4(X)’ 


where 
Z¢(X) = Free abelian group on points of X, 
and R4(X) c Z4(X) is the subgroup generated by 
div (f)c = (zeroes of f) — (poles of f), 


for all curves C C X, and nonzero rational functions f on C. 

The Grothendieck group Ko(X ) of algebraic vector bundles on X equals 
the Grothendieck group of coherent sheaves on X, since X is smooth. Let 
F4Ko(X) be the subgroup of Ko(X) generated by the classes of points on 
X. The induced surjective map Z4(X) — F4Ko(X) is easily seen to yield 
a surjection 


Wa: CH4(X) > F4Ko(X). 


Grothendieck’s algebraic theory of Chern classes, and the Riemann- 
Roch Theorem (“without denominators”), implies that the d‘® algebraic 
Chern class gives a homomorphism 


Gat FUR G(X) + CHU), 


so that the compositions Wg o cg and cg 0 Wg both equal multplication by 
(—1)¢-"(d —1)}. 
In particular, wq and cg are both isomorphisms modulo torsion. 


182 V. Srinivas 


Now assume V = Spec A is an affine open subset of a nonsingular pro- 
jective k-variety X of dimension d. Clearly 
CH (X) 


CHYV) = ; 
Ww) subgroup generated by points of X \ V 


Roitman’s Theorem on torsion 0-cycles, extended by Milne to arbitrary 
characteristic, gives a description of the torsion in CH“@(X), using which it 
can be shown that CH4%(V) is a torsion free, divisible abelian group (i.e., 
a vector space over Q). In particular, we see that the map Wg: CH4(V) > 
F4¢K (V) is an isomorphism. 

Thus, by Murthy’s theorem, all maximal ideals of A are complete inter- 
sections = CH4(X) is generated by points of X \ V. 

We now restate the Bloch and Bloch-Beilinson Conjectures in their 
“original” forms. 


Conjecture 10.3. (Bloch Conjecture) Let X be a projective smooth variety 
over C. Suppose that, for some integer r > 0, X has no nonzero regular 
(or holomorphic) s-forms for any s > r. Then for any “sufficiently large” 
subvariety ZC X of dimension r, we have CH4(X \ Z) = 0. 


For a smooth projective complex surface X, this conjecture states that 
if X has no holomorphic 2-forms, then CH?(X \ C) = 0 for some curve C 
in X. This has been verified in a few situations, for example, for surfaces 
of Kodaira dimension < 1 (Bloch, Kas, Lieberman), for general Godeaux 
surfaces (Voisin), and in some other cases. In higher dimensions, Roitman 
proved it for complete intersections in projective space, and there are a 
few other isolated examples, like the Kummer variety associated to an odd 
dimensional abelian variety (Bloch and myself). 


Conjecture 10.4. (Bloch-Beilinson Conjecture) Let X be a smooth projec- 
tive variety of dimension d over Q. Then CH4(X) is “finite dimensional”; 
in particular, there is a curve CC X so that CH4(X \ C) =0. 


As remarked earlier, there is only indirect evidence for this conjecture: 
it has not been verified for any smooth projective surface over Q which 
supports a non-zero 2-form (e.g., any hypersurface in projective 3-space of 
degree > 4). 

To exhibit one such nontrivial example is already an interesting open 
question. 

From the algebraic viewpoint, it seems restrictive to work only with 
smooth varieties. In any case, it is unknown in characteristic p > 0 that 
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a smooth affine variety V can be realized as an open subset of a smooth 
proper variety X (in characteristic 0, this follows from Hironaka’s theorem 
on resolution of singularities). 

Inspite of this, it is possible to make a systematic study of the singular 
case, and to try to extend the above conjectures. 

For the purposes of this chapter, let me focus on one very special situ- 
ation. Let 


Zer, 
be a non-singular projective algebraic k-variety, and 


A= @n>0An 
= homogeneous coordinate ring of Z. 


The affine variety V = Spec A is the “affine cone” over Z with “vertex” 
corresponding to the unique graded maximal ideal 2 = @psoAn, and the 
vertex is the unique singular point of V. The projective cone C(Z) over 
Z with the same vertex naturally contains V as an open subset, whose 
complement is a divisor isomorphic to Z, and the vertex is again the only 
singular point of C(Z). 

The first part of the following theorem is obtained using results from my 
paper [3] with Amalendu Krishna, in the 2-dimensional case, and a preprint 
of Krishna’s in the higher dimensional case. The result over Q is in [4]. 


Theorem 10.7. (i) Let k = C. Assume that V is Cohen-Macaulay (for 
example, d = 2 and A is normal) of dimension d < 3. Then every smooth 
maximal ideal of A is a complete intersection = H4~1(Z,Oz(1)) =0 

> H“(C(Z), Oc(zy) = 

(ii) Let kk = Q. Then every smooth maximal ideal of A is a complete inter- 
section. 


Here, (i) is analogous to the Bloch Conjecture, while (ii) is analogous 
to the Bloch-Beilinson Conjecture. 
Let us close with two examples. 


Example 1 ({3]) 
__ Az, y, 2] 
(Paes) 
Here, all smooth maximal ideals of A are complete intersections, while 


“most” smooth maximal ideals of A Qo C are not complete intersections. 
The complete intersection smooth maximal ideas are those determined by 
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points on the rulings of the affine cone over points of the Fermat Quartic 
curve with Q coordinates. This is a consequence of Theorem 10.7. 


Example 2 


Oey 

(zyz(l1-x—y-z)) 
Again, all smooth maximal ideals of A are complete intersections, while 
“most” smooth maximal ideals of A @g C are not complete intersections. 
In fact, there is an identification 


F? Ko(A @g k) = Ka(k), 


where K denotes the Milnor K2. Now one has the result of Garland (vastly 


generalized by Borel) that K2(Q) = 0, while K2(C) is “very large”. 
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This article is broadly based on the talk that was given at the symposium 
‘Perspectives in the Mathematical Sciences’, held at Indian Statistical 
Institute (ISI), Bangalore, to celebrate the Platinum Jubilee of the ISI. 
I would like to thank the organisers, especially Professor N.S.N. Sastry, 
for inviting me to lecture on this occasion. The article is intended for a 
general audience and does not contain any new results. We have aimed 
at highlighting several new results that have been proved on this topic 
by various authors, in the last few years. The proofs have been omitted 
and the interested reader is referred to the research papers, which we 
have tried to list rather extensively. 


11.1. Elliptic Curves and the Birch and Swinnerton-Dyer 
Conjecture 


Let F be a finite extension of Q and let E/F be an elliptic curve. Recall 
that EF has an affine equation 


where f(x) € Fa] is a cubic polynomial with distinct roots. A famous 
result of Mordell asserts that the group E(F’) of F-rational points of E 
is a finitely generated abelian group. Let gz/p denote the rank of E(F). 
Associated with EF is the complex L-function L(E/F,s) of E, which is 
defined by an Euler product. This function converges only for Re(s) > 3/2, 
but is conjectured to have an entire continuation (see [22]), and a functional 
equation relating its values at s and 2 — s. 
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When F = Q, thanks to deep results of Wiles ([24]) and [2], this con- 
jecture is true, but it is only for elliptic curves with complex multiplication 
that it so far has been proven over all number fields F’. Assuming the an- 
alytic continuation of L(E/F,s), the analytic rank, which we denote by 
’g/F, is defined to be the order of zero of L(E/F,s) at s = 1, the centre 
of its critical strip. In the 1960’s, based on rather compelling numerical 
evidence, Birch and Swinnerton-Dyer made the astonishing conjecture that 


JE/F = TE/F- cl1.1) 


A refined version of this conjecture even gives an exact formula for the lead- 
ing Taylor coefficient of the L-function at s = 1. For more details, see [25]. 
An important part of this exact formula is the order of the Tate-Shafarevich 
group of E’/F. For a finite extension K of F’, the Tate Shafarevich group of 
E over K, denoted by I(E/K), is defined by 


IU(E/K) = Ker (nu Bp) Tr.2), (11.2) 


Here, v varies over all the places of K and K, denotes the completion of kK 
at v, while E := E(F) denotes the group of points of E over a fixed algebraic 
closure F’ of F, considered as a module over the Galois group of K. Finally, 
for any field K and a module M over the Galois group Gx := Gal(K/K), 
the first cohomology group is denoted by H!(K,M). The Tate-Shafarevich 
group is among the most mysterious groups occurring in the study of the 
arithmetic of elliptic curves and part of the full Birch and Swinnerton-Dyer 
conjecture is that it is always finite. However, it was only in the late 1980’s 
that explicit examples of elliptic curves with finite Tate-Shafarevich group 
came to light. Kolyvagin, and independently Rubin, whose work was based 
on ideas of Thaine, gave these first examples. We remark here that we do 
not yet know the finiteness of the Tate-Shafarevich group for a single elliptic 
curve of rank at least 2. 


11.2. Congruent Number Problem 


Recall that a natural number N > 1 is said to be congruent if there exists 
a right angled triangle whose sides have rational length, and area N. In 
other words N is congruent if there exists rational numbers a, b, c in Q 
such that a? + 6? = c? and ab/2 = N. One of the oldest problems in 
number theory is to explicitly give an algorithm which would determine 
whether a given number is congruent. While Arab manuscripts dating back 
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to the tenth century A.D. give a long list of examples of congruent numbers, 
it is almost certain that the ancient Indians too grappled with congruent 
numbers and knew of many examples. For more on this subject, the reader 
is referred to the book by Koblitz ([{13]) and the article by Coates ([3}). 
A folklore conjecture in this subject is the following, which remains open 
despite overwhelming numerical evidence: 


Conjecture 11.1. Jf N is a positive integer congruent to 5, 6,or 7 modulo 
&, then N is congruent. 


The connection between congruent numbers and elliptic curves is the 
following. For any integer N > 1, consider the elliptic curve EF over Q 
defined by 


By 2 4? = a —N*x. 


Then N is congruent if and only if E has a rational point (x,y), z, yE Q 
with y # 0. Indeed, if (a,b,c) are the lengths of the corresponding right 
angled triangle, with area N and a? + b? = c’, then (x,y) with 


N(a+c)  —_ 2N?(a+c) 
—_ = — 
is a point on Ey with y ~ 0. Such a point is well-known to give a point 
of infinite order on Ey and the theory of L-functions shows that for N = 
5, 6, 7 mod 8, L(En,s) has a zero of odd order (and therefore a zero) at 
s = 1. Thus, Conjecture 11.1 is seen to be a special case of the Birch and 


Swinnerton-Dyer conjecture. 

Iwasawa theory is a p-adic theory that provides a systematic method to 
attack the Birch and Swinnerton-Dyer conjecture and has led to important 
results in the study of the arithmetic of elliptic curves. The main object 
of study here is the investigation of the Galois action on the dual Selmer 
group, viewed over certain infinite extensions of F’. We refer the reader 
to [21], [9], and [3] for detailed accounts of the Iwasawa theory of elliptic 
curves. Hereafter, we fix an odd prime p. Recall that for an elliptic curve 
E/F and a finite extension K of F’, the p-Selmer group, denoted S,(E/K), 
is defined as 


Sp(E/K) = Ker (wn, Eyx) — ®H"(K,,) | 


where Epo := U Espn (F) denotes the group of all p-power division points of 
m 


E(F) considered as a module over Gx. It is well-known that S,(E/K) isa 
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cofinitely generated Z,-module and we define 
Sp,E/K ‘= Zp — corank of S,(E/K). (11.3) 


For an infinite Galois extension K, of F with Galois group G := 
Gal(K/F) a p-adic Lie group, the Selmer group S,(E/K.) is defined 
as the direct limit over the Selmer groups S,(E£/L), as L varies over finite 
Galois extensions of F’ contained in K. It is clear that S,(E/K.) is a 
discrete G-module, and its Pontryagin dual denoted X,(E/K..) is a com- 
pact G-module, and is the dual Selmer group. It is even a finitely generated, 
compact module over the Iwasawa algebra A(G) defined as the inverse limit 


A(G) := limZ,[G/G', 


the limit being taken over the group rings Z,|[G/G"] as G’ varies over open 
normal subgroups, with respect to the natural maps. The key idea in Iwa- 
sawa theory is to study the arithmetic of E over suitable infinite extensions 
via the dual Selmer groups considered as A(G)-modules. For a finite ex- 
tension K of F’, Kummer theory yields the well-known exact sequence of 
discrete G-modules 


0 > E(K) @Q,/Z, — S,(E/K) — Il(E/K)(p) > 0, (11.4) 


where II(£/k)(p) denotes the p-primary torsion subgroup of II(£/k). 
For an infinite p-adic Lie extension K of F’', with Galois group G, a direct 
limit argument gives the exact sequence of G-modules 


0 E(Kx) ® Qp/Zp mis Sp(E/Koo) a IN(E/K)(p) — 0, 


where II(E/K.)(p) denotes the p-primary part of the Tate-Shafarevich 
group of FE over K and is defined as the direct limit of II(£/K)(p) as 
K varies over finite Galois extensions of F' in K... Iwasawa theory can be 
used to prove the following result: 


Theorem 11.1. Let N be an integer > 1 such that L(En/Q,1) = 0. If 
the p-primary part I(En /Q)(p) is finite for some odd prime p, then N is 
congruent. 


The central idea here is to prove that X,(ELn /Q) has Z,-rank at least one. 
Then, using the hypothesis on HI(Fy), one deduces that the same is true 
of Ex (Q). We shall return to this briefly in the next section. 
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11.3. Root Numbers and the Parity Conjecture 


Let Gr denote the absolute Galois group as before. An Artin representation 
of Gr is a finite dimensional complex representation p of Gr which is trivial 
on an open subgroup and thus factors through a finite Galois extension of 
F. Unless necessary, the base field will not be specified and the associated 
representation space will be denoted by V,. In a similar vein, as most of 
the results we need or state below are independent of the finite extension of 
the base field through which the Artin representation factors, we shall omit 
reference to the associated finite Galois extension, except in cases where it 
might be necessary to specify the extension. 

Given an elliptic curve E/F and an Artin representation p of Gr, there 
is an associated twisted L-function defined by an Euler product and de- 
noted by L(F, p,s) (see [23] for details). Again, this twisted L-function is 
conjectured to be entire with a functional equation. More precisely, let 


~ N(E, p ar S\ 4p ay" 
L(E, p, 8) ad (“Se ) r(=) r( 9 ) L(E, p, 8); 


here d, is the dimension of V, and N(E,p) is the global conductor of the 
Galois representation associated to that of E twisted by V, (cf. [6, 2.4]). 
The conjectured functional equation is 


L(E, p,s) = w(E, p) L(E, p,2—s), (11.5) 
where / is the contragredient of p and w(F, p) is the root number, which is an 
algebraic number of complex absolute value 1. We remark that even though 
the functional equation (11.5) is largely conjectural, the root number is 
nonetheless well-defined. Indeed, by the theorem of Deligne and Langlands, 
it can be written as a product of local root numbers, taken over all places 
v of F (see [23]). In particular, if p is self-dual (i.e. p = f), then the root 
number is equal to +1 and is often referred to as the sign in the functional 
equation. 


Definition 11.1. Assuming that L(F,p,s) is entire, the p-analytic rank 
is defined as r(F,p) = ord,—1 L(F, p,s). Note that if p is self-dual and 
irreducible, then 


w(E, p) = (—1)"), (11.6) 


Now let p be an irreducible Artin representation and K/F a finite Galois 
extension through which p factors, with Gx/p = Gal(K/F). 
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Definition 11.2. The multiplicity of the irreducible representation p 
occurring in the Gx/p-module E(k) ® Q, is the p-algebraic rank and is 
denoted by gz,p. 


The invariant gz,, is in fact independent of the prime p. 


Definition 11.3. The multiplicity of the contragredient representation p 
in X(E/K) ®z, Q, is denoted by 8p,n,p. 


The refined form of Birch and Swinnerton-Dyer conjecture in this con- 
text is the conjecture that 


GE,p = TE,» aNd gE.» = Sp,F, for all p. (11.7) 


Note that when p is the trivial representation these invariants coincide with 
the invariants gz, Tz and Ss, fF defined earlier. 

The p-parity conjecture is the assertion that for self-dual, irreducible 
Artin representations p, we have 


w(E,p) = (-1)**", or equivalently that s).7,5=Tz,. mod 2. (11.8) 


We stress that it is completely unknown whether s, ¢,, has the same parity 
as JE,p, or even whether the parity of s, 2.) is independent of p. When p 
is the trivial representation, we clearly recover (11.1) as a special case of 
(11.7). Finally, note that if the root number is —1 for some self-dual Artin 
representation, then the parity conjecture implies that the dual Selmer 
group X,(E/K) has positive Z,-rank for a finite extension K of F’. Further, 
the Birch and Swinnerton-Dyer conjecture predicts that E(k) is infinite. 
However, to deduce that the elliptic curve has a point of infinite order 
over K, by (11.4), one needs to know the finiteness of the p-primary part 
of the Tate-Shafarevich group of E. We refer the reader to the beautiful 
paper of Rohrlich ({19]) which elaborates on the interplay between Artin 
representations, root numbers and elliptic curves. 

Suppose that F’/F is an elliptic curve and we are given a p-adic Lie 
extension K. of F (with p and odd prime), with Galois group G. Let p 
be an irreducible, self-dual, Artin representation which factors through a 
finite quotient of G. Then there is the notion of the twisted dual Selmer 
group, which we denote by X,(E/K.ax, p) (see [4, §3]). The basic idea is to 
study the invariants gz, , TE,» and Sp,7,, for EF as p ranges over all self- 
dual representations of G. We remark that the study of the dual Selmer 
group twisted by an Artin representation p of G, [4, §3], considered as 
a A(G)-module (see §3), is an important ingredient in the determination 
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of the invariant s,,7,,. We refer the reader to the papers of Greenberg, 
Guo [10], [11] and [4] for more details. 


11.4. Recent Results 


In this section, we list some of the comparatively recent results towards 
the p-parity conjecture. Earlier affirmative results in this direction (when 
the representation p is trivial), have been proved by many authors, notably 
[1, 11, 16, 17]. More recently, interesting work has been done by [6, 7, 10, 
12, 14, 15, 18]. 

In the next section, we shall illustrate some of these results with nu- 
merical examples and also consider some interesting applications. Here is a 
striking result due to Tim and Vladimir Dokchitser ({8]): 


Theorem 11.2. Let E be any elliptic curve over Q and p any prime num- 
ber. Then 8) 5/9 =TE/Q mod 2. 


A completely different perspective is adopted in [4], where the results 
proved give some fragmentary evidence that a close connection exists be- 
tween root numbers and the dual Selmer group of an elliptic curve over 
certain non-commutative Galois extensions of the base field F’. In fact, the 
Galois extensions being non-commutative, provide us with a rich source of 
examples of infinite families of irreducible self-dual Artin representations 
of the corresponding Galois groups. We consider two such extensions below 
where the base field is assumed to be Q for simplicity. 

Let p be an odd prime. Fix an integer m > 1 which is p-power free, and 
not divisible by any prime of additive reduction for E. 


Definition 11.4. The False Tate extension of Q corresponding to m is 
defined as 


Foo = U Quip» sm"). (11.9) 
For n > 1, put 


i, = Q(ppn,m/?"), ky = Q(Lp), Ly = Q(m1/?"), (11.10) 


and let K*°Y° denote the field obtained by adjoining all the p-power roots 
of unity to kK := Q(pp). Put 


G = Gal(F,/Q), H =Gal(Fuo/K°) © Zp. (11.11) 


Our hypotheses above on m imply that the degree [L,, : Q| = p” for all 
n > 0. The extension F’,, is a p-adic Lie extension of Q with Galois group 
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isomorphic to the semi-direct product of H and Z;. As the group H is 
isomorphic to Z,, the Iwasawa algebra A(H) is isomorphic to the power 
series ring in one variable over Zp. 


We fix an odd prime p such that E'/Q has good ordinary reduction at 
p. Further, we shall also assume that the quotient of the dual Selmer group 
by its p-primary torsion subgroup, 


Yp(E/Foo) = Xp(E/Foo)/Xp(E/Foo)(p) (11.12) 


is finitely generated as a A(H)-module. Indeed, as we shall discuss in the 
next section, there are interesting numerical examples where these assump- 
tions are satisfied, and it is conjectured in [5] that this latter hypothesis 
always holds. The self-dual irreducible Artin representations of G are well- 
known in this case. Further, the twisted L-functions are all known to be 
entire, and satisfy the standard functional equation, thanks to deep results 
in automorphic forms (Langlands-Tunnell, Arthur-Clozel, Wiles, Breuil- 
Conrad-Diamond-Taylor). Also, the root numbers exhibit a surprisingly 
uniform behaviour as was shown by T. Dokchitser. Further, it can be shown 
that the parity of the root numbers is equal to that of the A(#)-rank of 
Y,(E/F..). We also have [4, §4| 


Theorem 11.3. Let E/Q be an elliptic curve such that E has good ordinary 
reduction at p. Assume that Y,(E/F.) (see (11.12)), is a finitely generated 
A(H)-module. Then for all self-dual, irreducible Artin representations p of 
G with dimension > 1, the p-parity conjecture holds, i.e. 


w(E, p) = (-1)°*, 
The second infinite extension is obtained as follows. Let E/Q be an 


elliptic curve with potential good ordinary reduction at p, where p > 5. 
Assume also that EF does not have complex multiplication. Define 


Foo := Q(Ep~ ) (11.13) 
where 
En — asp Epn 


is the Galois module of all p-power division points of E. Let G := 
Gal(F../Q), which by a theorem of Serre is an open subgroup of GL2(Z,). 
By the Weil pairing, K°° = Q("p~) is a subfield of F,,, and we put 
H := Gal(F/K°"°). The module Y,(£/F.) is again defined as in (11.12). 
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Recall that an Artin representation is said to be orthogonal if the underly- 
ing vector space carries a G-invariant, non-degenerate, symmetric bilinear 
form. The following theorem is a particular case of a more general result 
proved in [4, 86]. 


Theorem 11.4. Assume that E admits an isogeny of degree p over Q, 
where p is a prime of potential good ordinary reduction. Assume further 
that Y,(E/F0) is a finitely generated A(H)-module. Further, assume that 
the image of G in PGL2(F,) has even order. Then the p-parity conjecture 
holds for any self-dual, irreducible, orthogonal Artin representation p of G 
of dimension greater than 1. 


It is intriguing to note that the proof of this theorem given in [4] is 
uncannily parallel to Rohrlich’s computation of the root numbers w(£, p) in 
this case [20], perhaps suggesting an as yet undiscovered deeper connection 
between Iwasawa theory and local root numbers. 

Finally T. and V. Dokchitser (see [8]) have proven the following more 
general result by completely different methods, which do not involve Iwa- 
sawa theory. 


Theorem 11.5. Let E/Q be an elliptic curve with semi-stable reduction at 
2 and 3, and let p be any prime. Let K be a finite Galois extension of Q 
such that the p-Sylow subgroup of Gal(/Q) is normal and has abelian quo- 
tient. Then the p-parity conjecture holds for the prime p, and all orthogonal 
representations of Gal(K/Q). 


11.5. Examples and Applications 


In this final section, we give some applications of the p-parity conjecture 
to obtain lower bounds on the Zp-ranks of dual Selmer groups, and discuss 
several numerical examples. It is also worth noting that Iwasawa theory 
can be used to give upper bounds for the Z,-coranks of the p-components 
of the Selmer group, as p varies over the irreducible Artin characters of a 
p-adic Lie extension. In this spirit, we end by stating a joint conjecture with 
J. Coates that proposes strong upper bounds for the multiplicities of Artin 
representations which can occur in the dual Selmer group of an elliptic 
curve, considered over finite extensions within a p-adic Lie extension. 

Let E/Q be an elliptic curve and write Ng for the conductor of FE. Let 
p be an odd prime. For each integer n > 1, let F;, be the fixed field of the 
centre of Gal(Q(Ep» )/Q)). The following result is a special case of a more 
general result of [15, Corollary 2.5]. 
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Theorem 11.6. Assume that E has good ordinary reduction at p and a 
rational prime of order p. Suppose further that every prime of bad reduction 


of E has odd order in F>, and that —Np is not a square mod p. Then there 


exists a positive rational number c independent of n such that for every 
n> 1, we have 


2n 
8y,.E/F, 2 CP: 


As stated above, Iwasawa theory provides lower bounds for the coranks 
of the Selmer groups. Particularly striking is the case of the False Tate 
extension (see Definition 11.4), when the module Y,(E/F..) (see (11.12)) 
has A(#)-rank 1 where H is as in (11.11). We remark that there are many 
numerical examples where this is the case. We then have the following 
theorem [4, Theorem 4.8], where we recall that K = Q(Up). 


Theorem 11.7. Assume Y(E/F,,) is a finitely generated A(H)-module of 
A(#)-rank 1. Then for alln > 1, we have 


Sp,E/Ln = + Sy,B/Q)  8p,E/F, = P —1+ Sp,z/K; 
where the fields F,, and Ly are as in (11.10). 
A numerical example of this theorem is given by the elliptic curve 
EB: yity=22-2’, 
and for the prime p = 3 with m = 11. We deduce from the theorem that 
§3.B/L, =", and s3n/r,=3"—1, forn>1. 


Similarly, the assertions of the theorem hold for p = 7 and m = 2, 
showing that 


87,E/In =, and s7n/r,=7", forn>1. 


Here we have used the fact that s, g/K = 1. 
There is also a lower bound in the case of the GL» extension PF, as 
defined in (11.13). 


Theorem 11.8. Let E be an elliptic curve without complex multiplication 
and p a prime of potential good ordinary reduction. Let 


Fy = F(Ep~), Fn = Q(Epn). 
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In addition to the hypotheses of Theorem 11.4, assume that p = 3 mod 4. 
Then there exists c > 0 independent of n, such that 


Snr, 2 ep” (ee 1). 


As a numerical example of both Theorem 11.4 and Theorem 11.8, take 
E to be the elliptic curve 


y? + ary = g-2-1, 


of conductor Ng = 2.3.77, with p = 7, and F, = Q(E7~). Then E achieves 
good ordinary reduction at the unique prime of Q(j17) above 7. Moreover, 17 
is a Galois submodule of E7, and so EF has an isogeny of degree 7 defined 
over Q. It can be shown that X7(E/F.,.) is a finitely generated A(H)- 
module where H = Gal(F/Q(t7~)). Further, the image in this case of 
G in PGL2(F,) has order 42. Hence all the hypotheses of Theorems 11.4 
and 11.8 hold in this example. We remark that Rohrlich has shown that 
the cases w(E,p) = +1 and w(E,p) = —1 both occur for infinitely many 
self-dual irreducible Artin representations p of G. 

We end with a conjecture proposed jointly with John Coates, which 
was suggested by Theorem 4.12 of [4]. Let F, be a Galois extension of Q 
which is unramified outside a finite set of primes, and whose Galois group 
G is p-adic Lie group. We assume that F contains the cyclotomic Z,- 
extension Q°Y° of Q. Let ¥ denote the set of all one dimensional characters 
of Gal(Q°’°/Q), and E be any elliptic curve defined over Q. 


Conjecture 11.2. Assume E has potential good ordinary reduction at p. 
Then there exists an integer C’, depending only on E and F5, such that, 
for all irreducible Artin representations p of G, we have 


Sy Spbpx SC. (11.14) 


Naturally, there is another version of the above conjecture in which we 
replace Sp.f.9y in (11.14) by rz,,,. Of course, the generalised Birch and 
Swinnerton-Dyer conjecture (11.7), would imply the equivalence of the two 
versions. We note finally that Theorem 4.12 of [4] shows that the first 
version of the above conjecture holds for the False Tate extension. Both 
forms are true for the cyclotomic Z, extension of Q by virtue of well-known 
theorems of Kato and Rohrlich. At present, it is completely unknown for the 
extension F,, = F (Ep) when E does not admit complex multiplication. 
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Chapter 12 


von Neumann Algebras and Ergodic Theory 


V.S. Sunder 
The Institute of Mathematical Sciences, 
C. I. T. Campus, Chennai 6001138, India 
sunder@imsc.res.in 


There has been a long-standing and strong link between ergodic the- 
ory and von Neumann algebras (in particular, factors) dating back to the 
seminal work (cf. [18]) of Murray and von Neumann, specifically their con- 
struction of the first examples of factors of type II and type III. The bridge 
is provided by the celebrated group-measure space construction (or the 
crossed-product construction in modern parlance). In this survey, we shall 
commence with a discussion of some aspects of the magnificent edifice cre- 
ated by Murray and von Neumann, Dye, Krieger, Connes, Ornstein, Weiss, 
Feldman, Moore, ..., and conclude with an attempt* to describe some 
“rigidity” results of Gaboriau and Popa. 

We commence proceedings with brief introductions to each of the topics 
von Neumann algeras, ergodic theory, the group-measure space construction 
and JI, factors. 


von Neumann algebras 

A von Neumann algebra is a self-adjoint (ie., 7 € M => a* € M) 
unital (i.e., 1 € M) subalgebra M of the *-algebra B(#) of all continuous 
linear operators on a Hilbert space” H, which satisfies any of the following 
equivalent requirements: © 


‘It is only natural that the picture portrayed here is coloured/flawed by the author’s 
own perceptions/limitations of exposure, and it is almost sure that there have been 
many grave omissions, for all of which only the author’s limitations are to blame, and 
the author apologises for any such errors or omissions. 

‘All our Hilbert spaces will be assumed to be separable. 

“The equivalence of these three conditions — two topological, one algebraic — is von 
Neumann’s celebrated double commutant theorem. 
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(1) M is closed in the strong operator topology — ie., 7; € M,x € 
B(A),||(ai — x)€|| -0VE €§ H>axEeM 

(2) M is closed in the weak operator topology — ie., x € M,a € 
B(A), (a; — 2)€,n)0VE,n Ee H>exEeM 

(3) M"(= (M')’) = M, where S’ = {x € B(H) : zs = sxVs € S} denotes 
the commutant of S. 


The prototypical example of an abelian von Neumann algebra is given 
by the algebra A = L°(X,B, w) of essentially bounded measurable func- 
tions on a standard probability space (X, B, 1), viewed as a subalgebra of 
B(L?(X)) via f-€ = feVf € A,€ € L?(X). At the other extreme from 
an abelian von Neumann algebra is a factor, i.e., a von Neumann algebra 
whose center MM M’ reduces to the scalar operators C. 

It was recognised early that an important component to a von Neumann 
algebra is the set P(M) = {p € M : p= p* = p} of its projections. Just as 
all measurable functions can be approximated by simple functions, it is true 
that the linear subspace spanned by P(M) is norm-dense in M. Two pro- 
jections p,q are said to be (Murray-von Neumann) equivalent “rel M” — 
denoted by p ~y q — if there exists a u € M such that u*u = p, uu* = q. 
It turns out that MM is a factor if and only if any two projections are “com- 
parable” in the sense that one is equivalent to a sub-projection of the other. 
Murray and von Neumann initially classified factors into types I (there ex- 
ists a minimal projection), II (there do not exist minimal projections, but 
there do exist non-zero projections which are finite meaning they are not 
equivalent to any strictly smaller sub-projection) and III (there do not exist 
non-zero finite projections). 

(The material in this section first appeared in the papers of von Neu- 
mann, either singly authored or co-authored with Murray: see [18].) 


Ergodic theory 

Ergodic theory deals with the study of transformations T’ on a measure 
space (X,B,,) — which we will always assume is a complete standard 
probability space; the map T’ is usually assumed to be bijective mod yp, 
bimeasurable and non-singular — i.e., there are p-null sets N,,.N2 such 
that T maps X \ N, 1-1 onto X \ No, and E € B © T(E) € B and 
u(T~'(E)) =0 = p(E) = 0. A countable group TI of such transformations 
y is said to act ergodically if it satisfies any of the following equivalent 
conditions: 


(1) u(y (E)AE) = Vy €T => w(E) =0orl 
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(2) f=fowwy,7eTr = f is constant a.e. 
(3) E,F € B,y(E) > 0,pu(F) > 0=> 3y €T such that u(FNy(£)) > 0. 


Group-measure space construction 

Suppose [ is a countable group of non-singular transformations of a 
standard Borel space (X, 6) , equipped with a o-finite measure ps. Let H = 
(?(T, L?(X, B, 1)); the equations 


(a(F)EN() = (F o-EC) 
(A(70)E)(7) = El") 
respectively define a *-algebra representation of A = L™°(X,B,,) into 


B(#) and a unitary representation of I into B(H), and these represen- 
tations satisfy the commutation relation 


My)a(f) = (Ff oy *)A) (12.1) 
The von Neumann algebra M = (X(T) Ux(A))” generated by these two 
representations is denoted by A xT and called the crossed product of A 
with I’. Suppose the group T acts freely: i.e., for each y 4 1 in T, the set of 
points fixed by y is assumed to be a pi-null set. Then, we have the following 
beautiful result due to von Neumann (cf. [18] or [16]): 


Theorem 12.1. AxT is a factor if and only if T acts ergodically. Further, 
in this case: 


(1) The following conditions are equivalent: 
(i) is atomic; 
(iit) M = A™xT has a minimal projection 
In this case, M is a factor of type I. Further M is said to be a factor 
of type In,n < co if w admits precisely n mutually disjoint atoms. 

(2) The following conditions are equivalent: 
(i) uw has no atoms, but there exists a o-finite measure v which is mu- 
tually absolutely continuous with pu, which is invariant under T (i.e., 
yoy =vwvy); 
(ii) M is type IT 
In this case, 1 is a finite projection in M precisely when v is a finite 
measure. 

(3) M is type III if and only if there is no o-finite measure v which is 
mutually absolutely continuous with uw, which is invariant under [. 


Thus, we have our first examples of factors of type JJ — both type I, 
(which is type JJ with 1 being a finite projection) and type IJ, (which 
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is type IJ with 1 not being a finite projection) — and type J/I from the 
following examples of groups [' acting ergodically on Lebesgue spaces: 


e (1I,) T = Z acting on ($1, Bs, #-d0) via ne? = e?™(Ono) with a 
being irrational. 

e (JJ,.) T = Q acting on (R, Br, dx) via translation (r.2 = r+ 2x) 

e (111) T=QxQ acting on (R, Br, m = dz) via (b,a).2 = ax +b. (The 
point here is that I does not preserve the measure m, while the proper 
subgroup [9 = {(b,1)} CT preserves m and itself acts ergodically, and 
such a group [' cannot admit any o-finite equivalent invariant measure. ) 


IT, factors 

Note that the only finite factors are the factors of type I,,n < oo 
or of type J/,. It is a fact that a factor M is of finite type if and only 
if it admits a trace, i.e., a linear functional tr such that tr(1) = 1 and 
tr(zy) = tr(yx) Vz,y € M and tr(a*x) > 0 Va € M; further, such a 
trace is automatically faithful (0 4 « € M = tr(a*x) > 0) and unique. 
A type I,, factor is isomorphic to the full matrix algebra /,,(C), and the 
corresponding “tr” is nothing but the usual matrix trace normalised by a 
factor of 1/n. On the other hand JJ, factors are infinite-dimensional, but 
their “finiteness” results in many pleasant features. 

What is also true of a finite factor is that if p,q € P(M), then p ~yy 
q = tr(p) = tr(q). While the set {tr(p) : p € P(M)} is nothing but 
{k/n :0< k < n} in the I, case, it turns out to be [0,1] in the I; 
case. A Hilbert space equipped with a normal (= appropriately continuous) 
*_representation of a IJ, factor M is referred to as an M-module. It turns 
out (as a perfect parallel with the case of J, factors) that a module H 
over a IJ, factor M is classified, up to M-linear isomorphism, by a number 
dimyH (which can be any number in [0, 00]), the so-called von Neumann 
dimension as an M-module. 

If A : [—?(1) denotes the left-regular representation of a countable 
group [’, then the equation 


iris) = @L.1) 


" where 1 


defines a faithful trace on the von Neumann algebra LT = A(T) 
denotes the standard basis vector indexed by the identity element of ; and 
LT is a IT, factor if and only if the conjugacy class of every y # 1 is infinite 


(the ICC condition). 
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Almost all the material, so far, in this section, is from the seminal work 
of von Neumann ([18]). Some of the details, in slightly more modern termi- 
nology, may also be found in [16], for instance. 


Two questions: 


(1) What pairs of algebras (M, A) arise in the above manner? 
(2) When do two ergodic dynamical systems (X;, Bj, ui,0;),4 = 1,2 yield 
isomorphic pairs (M;, A;) as above? 


The first question, or rather, a near relative (where one considers more 
general crossed-products twisted by a 2-cocycle) has been answered very 
satisfactorily in [9], and the answer turns out to be: precisely when A is a 
Cartan subalgebra of M — meaning that it has the following properties: 


e A is a maximal abelian von Neumann subalgebra of M; 

e The normaliser Ny (A) = {u € U(M) : uAu* = A} (where U(M) = 
{ue M:u*u = uu* = 1} is the unitary group of W) generates M as 
a von Neumann algebra: i.e., M = Ny7(A)”; and 

e there exists a faithful conditional expectation of M onto A. 


We shall say no more about the first question, since our concern is 
primarily with the second, whose answer turns out to be: 


if and only if the two actions are orbit equivalent 


The notion of orbit (or weak-) equivalence (see definition below) was 
introduced (and the validity of the answer established) in the measure- 
preserving context by Dye (cf. [5], [6]) and studied (and the validity of the 
answer established) in the non-singular case by Krieger (cf. [12], [13]). 

Before getting to the pertinent definitions, we first make two blanket 
assumptions for the remainder of this paper. 

All our measure spaces (X,B, 1) will henceforth be assumed to be com- 
plete standard probability spaces equipped with a non-atomic probability 
measure; “isomorphisms between such triples are bijective (mod null sets), 
bimeasurable measure preserving transformations”. 


Definition 12.1. 


(1) An isomorphism between two spaces (Xj, By, i) and (X2, Bo, 2) isa 
bijective bimeasurable map ¢ : X1 \ Ni—X2 \ No, for j;-null sets Nj, 
such that 0 ¢7! = po. 
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(2) A dynamical system is a tuple (X,B,,a,0) where [ is a countable 
group, and a: Tl—Aut(X, B, 1) is a homomorphism of groups. 

(3) Two dynamical systems (X;, B;, u;,a;,0;),i = 1,2 are conjugate if there 
exists an isomorphism ¢ : X,;— X2 such that a2(T2) = ¢da1(T1)¢"+. 

(4) Two dynamical systems (X;, B;, 4;,a;,0;),i = 1,2 are orbit equivalent 
if there exists an isomorphism ¢ : X,;—X2 such that ¢(ai(fi)2) = 
a2(C2)¢(x) for wi- aa x. 


Every dynamical system (X,B,,a,T) gives rise to an equivalence re- 
lation — which we shall denote by Rp or Ra — which is the Borel subset 
of X x X given by {(x,a(y)(x)) : a € X,y € T}. This equivalence relation 
has countable equivalence classes. In fact, a result of [8] shows that any 
such standard equivalence relation (with countable classes) arises as orbit 
equivalence defined by a countable group [ acting as Borel isomorphisms of 
(X, B) — although not necessarily freely according to a result of Furman. 

Question 2 above may be viewed as asking when two dynamical systems 
are orbit equivalent — i.e., when is there a Borel isomorphism f : X,;—X2 
such that (f x f)(Ra,) = Raz. Dye showed ([5]) that any two ergodic 
actions of Z are so isomorphic. A volume of work by several people (notably 
Dye, Connes, Feldman, Krieger, Vershik, ...) culminated in the following 
beautiful result proved by Ornstein and Weiss (cf. [14], see also [4]). 


Theorem 12.2. (Ornstein-Weiss) Jf [; and T2 are infinite amenable 
groups, every ergodic action of [, is orbit equivalent to every ergodic action 
of T. 

Equivalence relations obtained from such actions of such groups are 
characterised by the following property of hyperfiniteness: 

there exists a sequence of standard equivalence relations Ry», on X with 
finite equivalence classes such that 


Ten C hg Ot ond Ts = Up 


Thus Rp remembers neither I nor a if T is an infinte amenable group 
and a is an ergodic action. On the other hand, at the other end of the spec- 
trum, many people (Zimmer, Furman, Gaboriau, and later Popa, Monod, 
Ozawa, ...) have obtained “rigidity results” which say something like this: 
if Ra, are orbit equivalent, then under some conditions on the T;, these 
two dynamical systems must actually be conjugate! (For an example, see 
Popa’s strong rigidity theorems (cf. [17]), which say something like this: 

Certain kinds of free ergodic actions of certain kinds of groups G are 
such that if the resulting equivalence relation R has the property that the 
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‘induced relation’ Ry (defined in the third paragraph below) is isomorphic to 
Rr for some Borel subset Y and some free ergodic action of some countable 
group T, then Y must have full measure, and the actions of T and G must 
be conjugate through a group isomorphism. 

It follows that for a relation R as in this strong rigidity theorem, the 
restriction Ry to a Borel subset with 0 < u(Y) < 1 can never be obtained 
from a free ergodic action of any countable group I’, thus furnishing another 
proof of Furman’s result mentioned earlier. 

The key notions used in Gaboriau’s work are stable orbit equivalence, 
measurable equivalence and ¢?-Betti numbers, upon which we now briefly 
dwell. 

It is well known that if the action is ergodic, then the “space of orbits” 
(= the quotient of X by the relation of being in the same orbit) does 
not have a “good Borel structure”, i.e., is not standard. The space R is 
a good substitute. Now, if A is a Borel subset of positive measure in X, 
then A meets almost every orbit, so by the philosophy expressed in the 
previous sentence, the induced relation Ra := RM(A x A) is an equally 
good description of the “space of orbits”. Let us call ergodic equivalence 
relations R; on standard probability spaces (X;, B;, w;) (for i = 1,2) stably 
orbit equivalent (or simply SOE) if there exist Borel subsets A; € B; of 
positive measure, a positive constant c and a Borel isomorphism f : A; Ao2 
such that ju2 0 f = cfy on A, and (f x f)(Ra,) = Ra,; and c is called the 
compression constant of the SOE. 

On the other hand, call two countable groups [;,2 = 1,2 measurably 
equivalent (or simply ME) if they admit commuting free actions on a stan- 


dard measure space (X,B, 1)? which admit a fundamental domain F; of 
LF 2) 
ee) 
The two notions of equivalence defined in the preceding paragraphs turn 


finite measure; call the ratio the compression constant of the ME. 
out to be closely related, and we have the following result, proved originally 
by Furman (cf. [7], [10]): 


Theorem 12.3. [, is ME to lg with compression constant c if and only 
if 1, and 12 admit free actions on standard probability space such that the 
associated equivalence relations are SOE with compression constant c. 


Now, we briefly discuss ¢?-Betti numbers. These were first introduced 
by Atiyah in the context of actions of countable groups on manifolds with 
compact quotients; he relied on the von Neumann dimension dimppH, of 
the Hilbert space of harmonic L?-forms of degree n, which has the structure 


4Here the measure is allowed to be infinite (but should be o-finite). 
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of a module over the von Neumann algebra LT (generated by the regular 
representation of I’). This was later considerably extended by Cheeger and 
Gromov, who studied actions of countable groups on general topological 
spaces, and succeeded in defining the sequence {3,,(I°)} of ¢?-Betti numbers 
of any countable group. 

Next, Gaboriau defined the ¢?-Betti numbers 6,,(R) of any standard 
equivalence relation with invariant measure. He was helped in this by the 
work of Feldman and Moore, where a von Neumann algebra LR with a finite 
faithful normal trace had been naturally associated to a standard equiva- 
lence relation with invariant probability measure. (If R = Rp for an ergodic 
action preserving a probability measure space (X, B, jz), then LF is just the 
IT, factor given by the crossed product construction.) Gaboriau considers a 
universal R-simplicial compler ER and essentially observes that the space 
of (?-chains has a natural structure of an LR-module, defines 3,(R) as the 
LR-dimension of the corresponding reduced ¢?-homology groups of ER, 
and proves: 


Theorem 12.4. (Gaboriau) If an equivalence relation R is produced by a 
free action of 1 which preserves a probability measure, then 


Gaboriau goes on to prove that the ratio of corresponding ¢?-Betti num- 
bers of two ME groups agrees with the compression constant of the ME. 

Thus we find that if free actions of countable groups [; yield equiva- 
lence relations R;,7 = 1,2 which are orbit equivalent, and hence SOE with 
compression constant 1, then the groups I; must be ME with compression 
constant 1. 

Coming back to orbit equivalence, we deduce the following fact from 
the foregoing discussion: 

The ¢?-Betti numbers of orbit equivalent free actions are equal. 

The simplest example of groups in the same ME class is furnished by 
any two lattices, not necessarily co-compact, of a locally compact second 
countable group (as seen by their actions by left-, resp., right- multipli- 
cations on the ambient group). Gaboriau obtains many rigidity results, a 
sample being: 


Corollary 12.1. (Gaboriau) 


(1) No lattice in SP(n,1) is ME to a lattice in SP(p,1) ifn F p. 
(2) No lattice in SU(n,1) is ME to a lattice in SU(p,1) ifn F p. 
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(3) No lattice in SO(2n,1) is ME to a lattice in SO(2p,1) ifn # p. 
Proof. It is known from the work of Borel (cf. [2]) that 
BATS PUM, 1)) 40 St = 2 


B(T(SU(m,1)) F40ei=m 
B(T(SO(2m, 1)) #0 i= m 
where we write [(G) to denote any lattice in G. O 


Finally, we should mention that Gaboriau’s results have been used 
ingeniously by Sorin Popa to settle a long-standing conjecture of 
Kadison’s — regarding the existence of [7 factors with trivial fundamental 
group. 

If M is a IT, factor, there is a natural definition of the so-called 
amplification Mqg(M) (or the d x d matrix algebra over W/) where d is any 
positive real number. For instance, it may be identified with the (JZ, factor 
Endy (Ha) of) M-linear operators on the M-module Hq with dimyHa = d. 
von Neumann already realised the importance of the object, called the fun- 
damental group’ F(M) of M, and defined by 


F(M) ={d>0:M 2 Ma(M)}. 


Popa showed that there are many examples of J7; factors of the form LRa 
(arising from free ergodic actions a of suitable ICC groups) which do indeed 
have trivial fundamental group. An example of such an action is the natural 
action of SL(2,Z) on T?. In fact, Gaboriau and Popa have even shown 
(cf. [11]) that (each finitely generated non-abelian free group) F,, admits 
uncountably many free ergodic actions a; preserving a probability measure, 
which are pairwise not SOE, such that LR., has trivial fundamental group 
for each 7. Much more of the subsequent exciting developments, as well as 
pertinent literature, may be found in the article [17] by Vaes. 
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We discuss the Segal-Bargmann transform associated to the Laplacian on 
the Heisenberg group. Using analogues of Gutzmer’s formula for Hermite 
and special Hermite expansions, we obtain several characterisations of 
the image of L?(H”) under the Segal-Bargmann transform. 


13.1. Introduction 


In 1888 August Gutzmer ({5]) published a formula which in modern nota- 
tion amounts to the following: If F’ is a 27-periodic holomorphic function 
on the complex plane, then for every y € R we have 


CO 


27 
| |F(a+iy)?de = S > |F(k)|?e~?¥ 
0 


k=—oo 


under suitable assumptions on the Fourier coefficients F (k) of the restric- 
tion of F' to the real line R. The above is just Parseval’s identity applied to 
F(x) = F(a + iy). Integrating the above identity over R with respect to 
e- 2 dy we obtain 

2r (oe) 
| | |F (a + iy)Pe“ 24 dady = S~ |F(k) Pe". 
R JO 


k=—0co 


*Dedicated to 222 A. 
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If F = f «p, where f € L?(S') and p; is the heat kernel defined by the 
series 


pr (a) = » ook teika 


—Cco 


then the above becomes 
21 <i 20 
[fo expos iPe tardy =e f° |f(e)Pae 
RJO 0 


which is a characterisation of the image of L?($1) under the Segal- 
Bargmann transform which takes f into the holomorphic function F(a + 
iy) = f * p(w + iy). 

More generally, let G be a Lie group, A a (non-negative) Laplacian on 
G and p;,t > 0 the heat kernel associated to A. Given f € L?(G), the 
function f * pz, initially defined on G extends holomorphically to certain 
G-—invariant domain 2 contained in the complexification Gc of G. This 
transform, taking f into the holomorphic extension of f * p;, is known as 
the Segal-Bargmann transform and is of interest to mathematical physicists. 
It is also called the heat kernel transform for obvious reasons. 

This transform was studied by Bargmann ({1]) (and by Segal indepen- 
dently) when G = R”. They showed that the image of L?(IR") under this 
transform is a weighted Bergman space- namely, the space of all entire func- 
tions on C” which are square integrable with respect to a positive weight 
function w;. To be precise, they showed that 


[ Papo malady ey i, f(a)Pae. 
R2n Rn 


In 1994 similar results were obtained for all compact Lie groups by Hall ([6]) 
and later in 1999 Stenzel ([11]) treated the case of Riemannian symmetric 
spaces of compact type. 

In the Euclidean set up, with the standard heat kernel 


pe(x) = (Art) ~"/2e~ ae ll? 


the result of Segal and Bargmann can be described as follows. A function 
F on R” can be factored as F' = f * p; if and only if it can be extended to 
C” as an entire function and 


/ |F (a + iy) |?py2(y)dady < co. 
R2n 
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Another way of saying the same is that 


[Pee pyalu dud < oo 


The equivalence follows from the fact that 


| IF (e + ty) Pde = | F(©)Pe-2"Eae 
n R” 


which is the analogue of Gutzmer’s formula for the Fourier transform. Note 
that it is again a simple consequence of the Plancherel theorem. 

In 1978 M. Lassalle ({9], [10]) established analogues of Gutzmer’s for- 
mula for compact Lie groups and compact symmetric spaces. His formulas 
were used by Faraut in [4] to study Segal-Bargmann transform on com- 
pact symmetric spaces. The works of Hall ({6]) and Stenzel ({11]) did not 
use Lassalle’s formula but as can be seen from Faraut ([4]) and the recent 
work [14] of the author it is clear that the use of Gutzmer’s formula makes 
the proofs simple and transparent. In [2], [3] Faraut proved an analogue of 
Gutzmer’s formula for non-compact symmetric spaces which was used by 
Krotz, Olafsson and Stanton ([8]) to study Segal-Bargmann transform. 

The investigations of Krotz et al. ([8]) have revealed that the results 
which are true for compact symmetric spaces are no longer true in general. 
In the case of non-compact Riemannian symmetric spaces X = G/K, the 
solution of the heat equation f * p; associated to the Laplace-Beltrami 
operator does not extend to the complexification X¢ but only to a domain = 
called the complex crown. Even then, the image is not a weighted Bergman 
space; this was shown in the work of Krétz et al. [8]. Another surprising 
case is that of the Heisenberg group H”. For the full Laplacian on H” it 
was shown in Krétz, Thangavelu and Xu ((7]) that the image of L?(H") 
is not a weighted Bergman space in the usual sense. However, the image 
turned out to be a direct sum of two Bergman spaces defined in terms of 
oscillating weight functions. 

In this article we restrict ourselves to the Segal-Bargmann transform on 
the Heisenberg group. We obtain several characterisations - some are new 
and some already known- using various analogues of Gutzmer’s formula. 

As a motivation, return to R” and consider the two conditions char- 
acterising the image of L?(IR") under the Segal-Bargmann transform men- 
tioned above. The first is in terms of holomorphic properties of F’ whereas 
the second one transfers the condition on the Fourier transform side. In 
view of Gutzmer’s formula, both are equivalent. Let us look at the second 
condition more carefully. For each € € R” we have an irreducible unitary 
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representation e¢ realised on C. In terms of this the second condition takes 
the following form: the function e¢(x)F'(€) = F(€)e*”’> extends to C” as an 
entire function and 


I. [. lee(a + iy) F(€)|’prjaly)dydé < 00. 


This restatement of the characterisation motivates the following consider- 
ation. 

Given a Lie group G let G stand for its unitary dual with Plancherel 
measure dji(7). For f € L?(G) and 7 € G let a(f) be its Fourier transform 
which is a Hilbert-Schmidt operator on 7,, the Hilbert space on which 7 
is realised. Then we have another representation of G, denoted by the same 
symbol z and realised on Sg, the Hilbert space of Hilbert-Schmidt operators 
on H,. This is defined by g — m(g)T,T € So. Unlike the abelian case, it 
is not true that this representation can be extended to Ge for all T € So. 
However, under further assumptions on J it might have a holomorphic 
extension. For example, when T = 7(F’), F = f * pz we hope that we get an 
operator valued holomorphic function on Gc. Then the main problem for 
Segal-Bargmann transform reduces to characterise functions F' that can be 
written as f * p, in terms of properties of the above function. 

Taking the clue from the Euclidean case, instead of looking for a weight 
function w; on Gc so that 


* pe(z)|?wi(z)dz = . 
[rem (2)d [ees 


we can look for a weight function w;(z,7) on Gc x G such that 


Lf lle )a(F + Polirste(=, mdadu(a )= f lrtoPag 


In other words, we ask if the image can be identified with a direct integral 
of (operator valued) weighted Bergman spaces. In the case of Heisenberg 
group, we already know from the work [7] that the image is a direct integral 
of twisted Bergman spaces. The new approach leads to another such result 
where the fibres are operator valued weighted Bergman spaces (see Theorem 
2.0): 

As we have noted above, both approaches lead to the same result for 
the Euclidean spaces. Using Gutzmer’s formula, which is due to Lassalle 
({9]), we can also prove the same (i.e., the equivalence of two approaches) 
for all compact symmetric spaces (and hence for all compact Lie groups). 
Our investigations with non-compact symmetric spaces does not lead to 
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anything new. In this respect Heisenberg group seems to be special. We 
believe similar characterisations are true for all nilpotent Lie groups. 


13.2. Segal-Bargmann Transform on the Heisenberg Group 
The Heisenberg group H” is just R” x R” x R with the group law given by 
1 
(x, u, €)(2',u',€) =(a@+a'utul, sau —u-a')+€+€). 


Here x-u = )0_, vjuj, as usual, denotes the standard pairing on R”. On 
HH!” we consider the (2n + 1) vector fileds 


1 1 
Xi = Oz; a 5 506, U; = Ou; - 5259 Z = 0¢,j = 1,2,...,0 


forming a basis for the Heisenberg Lie algebra. Recall that [X;,U;] = Z, 
all other brackets being zero. We consider the full Laplacian 


n 
_ 2 2 2 
A=—S°(X?+U}?)-Z 
j=l 
which is a non-negative elliptic differential operator which generates a dif- 
fusion semigroup. The heat kernel associated to this semigroup is explicitly 


given by 


Pr(a, U,€) = cn f 


R 


=tr€ tr? A " —1+)(coth td)(a-x+u-u) 
a. ™ (<rx) a uA 


with c, = (47)~” (see [7] for a proof). 
In the sequal we use the notation 
Pau) = fe F(a, u, gas 
R 


for a function f on H”. In this notation we have 


3 2 
pe(aru,8) = f ee p(a,u) dd 
R 
where 


A m4 

r = —7A(coth tA) (w-au-u) | 
Bye) = Ge (= a) : 

From the above representation it is clear that p;(x,u,&) extends to the 

complexification C” x C” x C of H” as an entire function p;(z, w, ¢). There- 

fore, for any f € L?(H") the convolution f * p; also extends as an entire 

function. This map, which takes f into f*p;(z,w, ¢) is the Segal-Bargmann 


214 S. Thangavelu 


transform or heat kernel transform for the Heisenberg group. Our problem 
is to characterise functions F in L?(H") that can be factored as f * p; for 
some f € L?(H"). 

As f * p, extends to an entire function on C” x C” x C it is natural to 
ask if there is an analogue of Bargmann’s result, namely, does there exist a 
non-negative weight function W;(z, w, ¢) such that F’ on H” can be factored 
as F = f «p:,f € L?(H”) if and only if F extends to C” x C” x C as an 
entire function which is square integrable with respect to W;. In [7] the 
following result was proved, answering the above question in the negative. 


Theorem 13.1. The image of L?(H") is not a weighted Bergman space. 


Our first characterisation is in terms of holomorphic properties of 
F*(x,u). We consider the weight function 


WP (Z, w) a a aa (2y, 2v) 


where z = a2+iy,w =u+tiv. Let BA(C?”) be the Hilbert space of entire 
functions G on C2” which are square integrable with respect to W). We 
equip B)(C?”) with the norm 


Kea - [. IG(z, w) |? WP (z, w)dzdw. 


These spaces are called twisted Bergman spaces. We then have the following 
result. 


Theorem 13.2. A function F € L?(H"”) can be factored as F = f * p; if 
and only if for every X #0, F*(x,u) extends to an element of B\(C?”) and 


fe LPI a < 00. 


This result, proved in [7], shows that the image of L?(H”) under the 
Segal-Bargmann transform is a direct integral of twisted Bergman spaces. 
Moreover, it was proved in [7] that we have 


[eM ira = [i io Pag 
R : H” 


where f is the function related to F via F = f * p;. The above result is in 
terms of the Fourier transform of F in the central variable. Our next result 
is in terms of the group Fourier transform of F’. 

In order to define the group Fourier transform of a function f on H” we 
need to recall the relevant representations of H”. We use the Schrodinger 
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picture. For every non-zero real A we have an irreducible unitary represen- 
tation 7) of H” which is realised on L?(R”). It is explicitly given by 


T(x, u, E)p(v) = ef eiMEE+EE o(y + u) 


for y € L?(IR"). By a theorem of Stone and von Neumann any irreducible 
unitary representation of H” which is nontrivial at the center is unitarily 
equivalent to some 7). Using these representations we define the Fourier 
transform of a function f € L‘(HI”) as the operator valued function 


fa) =f Fla)ma(a)as 


The Plancherel theorem says that the above definition can be extended to 
all f € L?(H"), f(A) is Hilbert-Schmidt for every A and we have 


| If (9) Pag = f FON Zrsd(d) 
H” R 


where du(A) = (27)~" “1 |A|"dA is the Plancherel measure. 

The Fourier transform of the heat kernel can be explicitly calcu- 
lated in terms of the Hermite operators H(\) = —A + A?|a|?. In fact 
pA) = ee tH) where e~t#) is the Hermite semigroup generated 
by H(A). The Hilbert space L?(IR") has an orthonormal basis consisting of 
the Hermite functions @4,a@ € N” which are eigenfunctions of H(A) with 
eigenvalues (2|a|+n)|\|. The image of L?(R”) under the Hermite semigroup 
e—'4(\) has been characterised as a weighted Bergman space, see [16] for a 
proof. Let 


Us @)= 2” (sinh(4tr))~"/2e tanh(2tA)|x|?—A coth(2ta)|y|? 


and define H}(C") to be the Hilbert space of entire functions on C” which 
are square integrable with respect to UA(z)dz. Then it is known that an 
entire function ® on C” belongs to H)(C”) if and only if 6 = e~#Vy 
for some y € L?(R”). This space HA(C”) is called the Hermite Bergman 
space. 


Theorem 13.3. A function F € L?(H") can be factored as F = f * p, if 
and only if for every \ 4 0, F(A)* maps L?(R") into H}(C”) in such a 
way that 


eo be \F9" Rg du() < 00. 


aeEN” 
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This result is not difficult to prove. The first assumption on F’ (A) means, 
in view of the characterisation of H}(C”), that there is an operator valued 
function Ty such that F (A)* = e t#O)T,. The second assumption shows 
that 7) is actually Hilbert-Schmidt and 


[ T,||2e2 du(A) < 00 


Hence, we can find f € L?(H”) such that f(A) = et T* and hence F’ = fx 
pz follows. The other implication is even simpler. The non trivial ingredient 
in the proof is the realisation of H)(C”) as the image of L?(IR”) under 
e 'H\)_ This can be proved using an analogue of Gutzmer’s formula for 
the Hermite expansions (see Section 3). 

We can define another family of representations of HI” realised on So, 
the Hilbert space of all Hilbert-Schmidt operators on L?(R"), by setting 
(g)T = 7y(g)T for T € Sp. This is clearly a unitary representation of H”. 
When T is an arbitrary element of Sg we may not be able to holomorphically 
continue the S2 valued function (x, u,€) > 7)(a,u,€)T. It turns out that 
such an extension is possible precisely when JT can be factored as T = 
SetH). This leads us to the following characterisation of the image of 
L?(H”) under the Segal-Bargmann transform. We let CA(C?2",S2) stand 
for the Sg-valued weighted Bergman space consisting of entire functions 
G(z,w) on C?” for which 


IGI2s =f ICGuiv)Irs phy, 2v)dydo <ce. 


Let Cfo(C?”,S2) be the subspace of C)A(C?”,S2) consisting of functions 
G(z,w) satisfying G(z, w) = 7)(z, w, 0)G(0, 0). 


Theorem 13.4. A function F € L?(H") can be factored as F = f * p, if 
and only if for every \ #0, 7(a,u,0)F(A) can be extended to C?” as an 
element of CAg(C™, 53) which satisfies 


[ve AC, OPOA)Bg, 2” dua) < 00. 


By defining w)(y, v, 7) = po, (2y, Qu)e~ 27 and dv(y,v,) = dydudn we 
can rewrite the above theorem as follows. 


Theorem 13.5. A function F € L?(H") can be factored as F = f * pt 
if and only if for every X # 0, 7)(x,u,€)F(A) can be extended as entire 
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function on C?"*! which satisfies 


[ f... Ir ( 19,00, in)F ae ‘i rg we (y, v, n)dv(y, v, n)dp(rA) < <0o. 


When it happens, the above integral is a constant multiple of || f \|3. 


A proof of this theorem requires an analogue of Gutzmer’s formula for 
the Hermite expansions which will be stated in the next section. Similarly, 
Theorem 2.2 can be proved using Gutzmer’s formula for special Hermite 
expansions. We can view the Heisenberg group as the homogeneous space 
G,,/K where G,, is the Heisenberg motion group and K = U(n). We there- 
fore have a Gutzmer’s formula for entire functions on the complexification 
C?"+1 of H!” which has been used in proving a Paley-Wiener theorem for 
the inverse Fourier transform on HI”, see [15]. We can also use it to prove the 
main result in [7], namely, the image of L?(H”) under the Segal-Bargmann 
transform is the direct sum of two weighted Bergman spaces defined via 
oscillating weight functions. 


13.3. Gutzmer Formulas and Their Applications 


The aim of this section is to state several analogues of Gutzmer’s formula 
relevant to the Heisenberg group and use them to obtain various character- 
isations of the image of L?(H”) under the Segal-Bargmann transform. We 
begin with an analogue of Gutzmer’s formula for the Hermite expansions. 
Recall that the spectral decomposition of the Hermite operator H(A) = 

—A + \?|2|? is given by 

[o.e) 

= $0 (2k + n)|A|Pe(A) 

k=0 


where P,(A) is the orthogonal projection of L?(IR”) onto the eigenspace 
spanned by {®2 : |a| = k}. Here ® are the normalised Hermite functions 
on R” scaled by 4. For various facts about Hermite functions we refer to 
[12]. The following result, which is an analogue of Gutzmer’s formula for 
the Hermite expansions, has been established in [16]. 


Theorem 13.6. Let f € L?(R") has a holomorphic extension F to C”. 
Then for any z=x+iy,w=utiv eC” we have 


[. [ roc F@Paoas 
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eeyco Rin 1)! — 
_ pA(u-y—v:-z) rd 2 
=e 7] +n py Pa (Ry, 260) Pi(A) Fle 


under some further assumptions on the function f. 


In the above theorem y?(y,v) stands for the Laguerre function 
ee 1 _21 21 yy? 
LENIN? + v2))e +), 


Note that we have used the notation y? in place of |y|? so that p(y, v) 
can be holomorphically extended to yr (z, w). The group K appearing in 
the above formula is S'p(n, R) 1 O(2n, R), the intersection of the symplectic 
group and the orthogonal group. It has a natural action on R” x R” which 
extends to C” x C”. The formula is valid if we assume, say, f = e~*#)g for 
some function g € L?(IR”). As an immediate corollary of the above formula 
we obtain the following result mentioned earlier. 


Theorem 13.7. An entire function F belongs to HA(C”) if and only if 
F =e-t4™g for some g € L?(R"). 


For a proof of this theorem we refer to [16]. The only other ingredient 
needed is given in the following lemma proved in [13]. 


Lemma 13.1. 


— (k+n-—1)! P 
I. vp (2iy, 2iv)po,(2y, 2v)dydv = com cde hal 
We are now in a position give a sketch of the proof of Theorem 2.4. 
First of all we need to show that 7)(x,u,0)F(A)* extends to C?” as an So 
valued entire function. If F = f * p; we are required to see if 


ty G (th. oe 4) F(ay*T*) 


extends as an entire function for every T’ € Sg. This amounts to check, in 
view of inversion formula for the Weyl transform, if p *) g, is an entire 
function where *) is the twisted convolution and g) is given by the relation 
m(gx) = f(A)*T*. But this is the case since p} is a Gaussian. 

We can calculate the Sz norm of 7, (z, w,0)e"*#? f(A)* using the Her- 
mite basis {®2 : a € N”}. Using Gutzmer’s formula for the Hermite expan- 
sions and some properties of the special Hermite functions it is not hard to 
show that 


| la (k-(z, w), Oe! F(A)" [Begdk 
K 
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co 
a i el e-2(2k+n)|alt_Ki(m = LL 


= py (iy Biv) IF xa GR lld- 


k=0 
Setting 2 = u = 0 and integrating the above with respect to p},(2y, 2v)dydu, 
we get 


[2 lax OFM: du) = en f I @Pas. 


This proves one half of the theorem. 

To prove the converse, suppose that F € L?(H") satisfies the hypothesis 
of the theorem. Then it is easy to see that it also satisfies the hypothesis of 
Theorem 2.3. Indeed, it follows from Gutzmer’s formula that 


lady ef siya PB v,2e)dyde 


for every g € H}. Hence we can appeal to Theorem 2.3 to conclude that 
F = f xp; for some f € L?(H”). 

Thus Gutzmer’s formula for the Hermite expansions is the main ingre- 
dient in the proof of Theorem 2.4 and Theorem 2.5 follows immediately. 
We now show that Theorem 2.2 can be proved by using another Gutzmer 
formula, namely the one for special Hermite expansions. 


Theorem 13.8. Assume that the entire function f satisfies ||f *, pp|l2 < 
ue 
Ce~(2k+n)IAl)?t for allt > 0. Then we have 


| / |f((x, u) + ta(y, v)) Perl), °C Ol dadadu 
R2n JU(n) 


k\(n — 1)! oo. 
=n. ee Pela Ey Ty Pe ie: 2iv). 
k=0 : 


In the above theorem, [(x,w),(y,v)] stands for the symplectic form 
(u-y—2-v) on R?”. For a proof of this formula we refer to [13]. From 
this and Lemma 3.3 it follows that an entire function F € B, if and only 
if F = f +, p) for some f € L?(C”). Theorem 2.2 is proved by observing 
that (f *p;)* = et” fA +) p>. 

We conclude this section with another Gutzmer formula for the full 
Heisenberg group. Let G,, be the Heisenberg motion group which is the 
semidirect product of H” with the unitary group U(n). This group has a 
natural action on H” which extends to all entire functions on C?"*?, 
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Theorem 13.9. Under suitable hypothesis on an entire function f on 
C2"+! we have the following identity: 


i If(g.(z,w,0)) Pag 
Gn 


= _d\May—vey ( kl(n — 1)! a 
= a ie (> If * PRE TE mm a Oe C2 2) dyi(A) 
“2 k=0 , 


where ||f> *, plo is the L?(C”) norm of f* * yp. 


We remark that the above theorem is valid for all those functions for 
which the right hand side of the above identity is finite. For example, when 
the Fourier transform of the restriction of f to H” is compactly supported 
(in a suitable sense) the theorem remains true. We refer to [13] for a proof of 
this theorem and also for the hypotheses under which it is true. The proof 
uses some calculations on special Hermite functions and a general theorem 
proved by Faraut ([{2]). A similar result is valid for the semidirect product of 
the reduced Heisenberg group with U(n) from which Gutzmer’s formula for 
the special Hermite expansions can be deduced. Gutzmer’s formula for the 
Heisenberg group has been used in [13] to prove a different characterisation 
of the image of L?(H") under the Segal-Bargmann transform. A version of 
Paley-Wiener theorem for the inverse Fourier transform also follows from 
Gutzmer’s formula. 
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My talk is a survey on finite translation generalized quadrangles. To 
each translation generalized quadrangle of order (s,t), with s 4 1 t, 
there corresponds a set O(n,m,q) of gq” + 1 (n — 1)-dimensional sub- 
spaces of the projective space PG(2n+m-— 1, q) satisfying (i) every three 
subspaces generate a PG(3n — 1,q) and (ii) for every such subspace 7 
there is a subspace PG(n + m — 1, q) containing 7 and having empty in- 
tersection with the other elements of O(n,m,q). Conversely, every such 
O(n,m,q) defines a finite translation generalized quadrangle. For each 
known example of O(n,m,q) we have m € {n,2n}, and for q even there 
are no other examples. Many papers were written on the case m = 2n. 
Here emphasis is on the case m = n, and besides interesting and useful 
old results several new theorems are stated. 


14.1. Finite Generalized Quadrangles 


14.1.1. Finite generalized quadrangles 


A (finite) generalized quadrangle (GQ) is an incidence structure S 


(P, B,1) in which P and B are disjoint (nonempty) sets of objects called 
points and lines respectively, and for which I is a symmetric point-line in- 


cidence relation satisfying the following axioms. 


(i) Each point is incident with 1 +t lines (t > 1) and two distinct points 


are incident with at most one line. 


(ii) Each line is incident with 1+ s points (s > 1) and two distinct lines 


are incident with at most one point. 


(iii) Ifa is a point and L is a line not incident with x, then there is a unique 


pair (y,M)€ Px B for whichea I MIyIL. 
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Generalized quadrangles were introduced by Tits ([42]) in the appendix 
of his celebrated work on triality. 

The integers s and t are the parameters of the GQ and S is said to 
have order (s,t); if s = t, S is said to have order s. There is a point-line 
duality for GQs (of order (s,t)) for which in any definition or theorem the 
words “point” and “line” are interchanged and the parameters s and ¢ are 
interchanged. Hence, we assume without further notice that the dual of a 
given theorem or definition has also been given. 

If the point x is not incident with the line L, in a GQ S = (P,B,]), 
then we write x Tf L. 

Let S = (P, B,1) be a (finite) GQ of order (s,t). Then S has v = |P| = 
(1+ s)(1+ st) points and b = |B| = (14+ t)(1 + st) lines; see 1.2.1 of [24]. 
Also, s +t divides st(1 + s)(1+¢), and, for s 41+ t, we have t < s? and, 
dually, s < t? (inequalities of Higman, (({13])); see 1.2.2 and 1.2.3 of [24]. 

Given two (not necessarily distinct) points x,y of S, we write x ~ y 
and say that x and y are collinear, provided that there is some line L for 
which x I LI y. And « ~ y means that x and y are not collinear. Dually, 
for L,M € B, we write L ~ M or L & M according as L and M are 
concurrent or nonconcurrent, respectively. The line which is incident with 
distinct collinear points x,y is denoted by xy; the point which is incident 
with distinct concurrent lines L, M is denoted by either DM or LOM. 

For « € P, put c+ ={ye P||y ~ 2}, and note that x € zt. 

A subquadrangle, or also subGQ, S’ = (P’, B’,I’) of a GQ S = (P,B,1) 
of order (s,t), is a GQ for which P’ C P,B’ C B, and where I’ is the 
restriction of I to (P’ x B’) U (B’ x P’). If the GQ S’ of order (s’,t’) is 
a subGQ of the GQ S of order (s,t), with S 4 S’, then either s = s’ or 
s > s’t’, and, dually, either ¢ = t’ or t > s’t’; see 2.2.1 of [24]. 

Let S = (P,B,1) be a GQ. A collineation or automorphism of S is a 
permutation of P UB that preserves P, B and incidence. 


14.1.2. Grids and dual grids 


A grid is an incidence structure S = (P, B,I) with P = {a;; ||i=0,1,..., 51 
and 7 = 04 505 ey Si >Oand s2>0, B= (iis Lin ss 05 dtey, Mio Migass y 
M,,},%i;1L, if and only if i =k, and 2,;1M;, if and only if 7 = k. We say 
that such a grid is an (s; + 1) x (sg + 1)-grid. A grid is a GQ if and only if 
$1 = 89 = s; in such a case the GQ has order (s,1). Any GQ of order (s, 1) 
is isomorphic to an (s+ 1) x (s+ 1)-grid. A dual grid has parameters ¢1, to, 
and it is a GQ if and only if t; = tg = t, in which case it is a GQ of order 
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(1,t). Any GQ of order (1, #) is isomorphic to a dual (¢ + 1) x (¢ + 1)-grid. 
An ordinary quadrangle is a GQ of order (1,1) and is at the same time 
a grid and a dual grid. This is the motivation for the term “generalized 
quadrangle”. 

A GQ of order (s,t) is called thin if either s = 1 or t = 1; in the other 
case it is called thick. 


14.1.3. The classical generalized quadrangles 


We now give a brief description of three families of examples known as the 
classical GQs, all of which are associated with classical groups and were 
first recognized as GQs by Tits. 

(i) Consider a nonsingular quadric Q of projective index 1, that is, of 
Witt index 2, of the projective space PG(d,q), with d = 3,4 or 5. Then 
the points of Q together with the lines of Q (which are the subspaces of 
maximal dimension on Q) form a GQ Q(d,q) with parameters 


s=q,t=1, v=(q+1)’, b= 2(q+1), when d = 3, 
s=9,1—¢, v =(q4+1)(q74+1), b= (¢+1)(¢ +1), when d = 4, 
s=qgt=@,v=(¢+1)(P41),b=(P +1)(¢ +1), when d=5. 


Notice that Q(3,q) is a grid. 
(ii) Let H be a nonsingular Hermitian variety of the projective space 


PG(d, q*), d = 3 or 4. Then the points of H together with the lines on H 
form a GQ H(d,q?) with parameters 


s=q,t=q, v=(P?+1)(P +1), b=(a+1)(q? +1), whend=3, 
s=(7,t=@,v=(P4+1)(¢ 41), b=(P4+1)(¢ +1), when d=4. 


(iii) The points of PG(3, q), together with the totally isotropic lines with 
respect to a symplectic polarity, form a GQ W(q) with parameters 


s=q,t=q,,v=(¢+1)(¢? +1), b=(¢+1(@* +1). 
Theorem 14.1. 


(i) The GQ Q(4,q) is isomorphic to the dual of W(q). Also, Q(4,q) (or 
W(q)) is self-dual if and only if q is even. 
(ii) The GQ Q(5,q) is isomorphic to the dual of H(3, q). 


For a proof of Theorem 14.1, see 3.2.1 and 3.2.3 of [24]. 
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14.1.4. Ovals, hyperovals and ovoids 


A k-arc of PG(2,q) is a set of k points of PG(2,q) no three of which are 
collinear. Then clearly k < q+ 2. By [5], for qg odd, k < q+1. Further, any 
nonsingular conic of PG(2,q) is a (q+ 1)-are. It can be shown that each 
(q+ 1)-arc K of PG(2,q), q even, extends to a (q+ 2)-arc K U {x} (see, 
e.g., [15], p. 177); the point x, which is uniquely defined by K, is called the 
kernel or nucleus of kK. The (q+ 1)-ares of PG(2,q) are called ovals; the 
(q + 2)-ares of PG(2,q), q even, are called hyperovals. 

For any k-arc K, with 3 < k < q+ 1, choose three of its points as the 
triangle of reference ugu uz of the coordinate system. The lines intersecting 
K in one point are called the tangent lines of kK. A tangent line of K through 
one of uo, U1, U2 has respective equation 


Ride =0,.o> GH =0,35 520 So, 


with d # 0. We call d the coordinate of such a line. Suppose the t = q+2—k 
tangent lines at each of uo, ui, u2 are 


X4 = aj; Xo = 0, X2 = b; Xo => 0, Xo = ci X1 = 0, 
i=1,2,...,t. Then Segre ((27]) proved the following important lemma. 


Lemma 14.1. (Lemma of Tangents) The coordinates a;,b;,c; of the 
tangent lines at uo, U1, U2 of a k-arc K through these points satisfy 


t 
TE, ap bic =-1. 


For an oval kK we have t = 1, and so the lemma becomes abc = —1. 
Geometrically this means that for q odd the triangles formed by three 
points of an oval and the tangent lines at these points, are in perspective, or, 
equivalently, that for any three distinct points uo, u,, v2 on an oval there is 
a (unique) nonsingular conic containing these points uo, ui, U2 and having 
as tangent lines at uo, u1, U2 the tangent lines of the oval at uo, ui, U2; for g 
even the condition means that the tangent lines at any three points of the 
oval are concurrent. 

Relying on Lemma 14.1, Segre ([27]) proves the following celebrated 
result. 


Theorem 14.2. In PG(2,q), q odd, every oval is a nonsingular conic. 


For q even, Theorem 14.2 is valid if and only if q € {2,4}; see, e.g., [35]. 
We now introduce the notion of “ovoid” as defined in [43]. An ovoid O 
of PG(3,q) is a set of points of PG(3,q), no three of which are collinear 
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and such that for any point of O the union of the lines which meet O only 
in that point, that is, the tangent lines at that point, is a PG(2,q). If O is 
an ovoid, its number of points is gq? +1. A k-cap K of PG(3,q) is a set of 
k, points no three of which are collinear. For any k-cap K of PG(3,q), with 
q#2,k < q?+1; for any k-cap K of PG(3, 2), k < 8 holds and the 8-caps of 
PG(3, 2) are the complements of planes. For q odd this result is due to Bose 
((5]), for g even to Qvist ([26]). A (q?+1)-cap of PG(3,¢q), q # 2, is precisely 
an ovoid (cf. [3] or [14]); the ovoids of PG(3,2) are the sets of 5 points no 
4 of which are coplanar. It is easy to show that each nonsingular elliptic 
quadric of PG(3, qg) is an ovoid. By a celebrated theorem, due independently 
to Barlotti ((3]) and Panella ([19]), every ovoid in PG(3,q),q odd or q = 4, 
is an elliptic quadric. 


Theorem 14.3. Each ovoid of PG(3,q), q odd, is an elliptic quadric. 


To the contrary, in the even case, Tits ((43]) showed that for any gq = 
22e+1 with e > 1, there exists an ovoid which is not an elliptic quadric; these 
ovoids are called Tits ovoids, or also Suzuki-Tits ovoids, and are related to 
the simple Suzuki groups Sz(q). In fact, for gq = 8, Segre ([28]) discovered 
an ovoid which is not an elliptic quadric, and which was shown to be a Tits 
ovoid in [12]. For even g no other ovoids than the elliptic quadrics and the 
Tits ovoids are known. 

If O is an ovoid in PG(3,q), then any plane 7 of PG(3,q) intersects O 
in either one point or in an oval. If |tM O| = 1, then we say that 7 is a 
tangent plane of O. At each of its points O has exactly one tangent plane. 
For more details, see [14]. Finally, a beautiful result due to Brown ((6]) tells 
us that any ovoid O of PG(3,q) containing at least one conic section, is an 
elliptic quadric. 


14.1.5. The generalized quadrangles T2(O) and T3(O) of Tits 


Let d = 2 (respectively, d = 3) and let O be an oval (respectively, an ovoid) 
of PG(d,q). Further, let PG(d,q) = H be embedded as a hyperplane in 
PG(d+1,q) = P. Define 


e Points as 
(i) the points of P\H, 
(ii) the hyperplanes X of P for which |X 9 O| = 1, and 


(iii) one new symbol (oo). 
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e Lines as 
(a) the lines of P which are not contained in H and meet O (neces- 
sarily in a unique point), and 
(b) the points of O. 

e Incidence is defined as follows. A point of Type (i) is incident only 
with lines of Type (a); here the incidence is that of P. A point of Type 
(ii) is incident with all lines of Type (a) contained in it and with the 
unique element of O in it. The point (oo) is incident with no line of 
Type (a) and all lines of Type (b). 


It is an easy exercise to show that the incidence structure T>(O) (re- 
spectively, T3(O)) so defined is a GQ. The parameters are 


s=qgt=q, v=(qt)I(P+1),b=(+1)(@ +1), whend=2, 
s=qgt=@,v=(¢qt+1)(P4+1),b6=(F+)(P +1), when d=3. 


Theorem 14.4. 


(i) The GQ T2(O) is isomorphic to the GQ Q(4,q) if and only if O is a 
nonsingular conic. The GQ T2(O) is isomorphic to the GQ W(q) if 
and only if q 1s even and O 1s a conic. 

(ii) The GQ T3(O) is isomorphic to the GQ Q(5,q) if and only if O is a 
nonsingular elliptic quadric. 


For a proof of Theorem 14.4, see 3.2.2 and 3.2.4 of [24]. 


Remark 14.1. 


(i) For gq odd any oval is a nonsingular conic, hence for g odd we always 
have T2(O) ~ Q(4, q). 

(ii) For gq odd any ovoid is an elliptic quadric, hence for gq odd we always 
have T3(O) = Q(5, q). 


14.1.6. The generalized quadrangles TZ (O) 


Let O be a hyperoval of PG(2,q), so q is even. Embed PG(2,q) = H asa 
hyperplane in PG(3, q) = P. 


e The points of the GQ T3(O) are the points of P\H, 
e lines of the GQ are the lines of P not in H which meet O, and 
e the incidence is inherited from P. 
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Then T3(O) is a GQ with parameters 
s=q-lLt=q+1lwe=¢,b=(¢+2)¢. 


14.1.7. Orders of the known generalized quadrangles 


The orders of the known GQs are 


(5,1), s E N\{0}, 
(1, ¢), t € N\{0}, 

(4,9); q any prime power, 
(q,9°); q any prime power, 
(q?, qd); q any prime power, 
(a a), q any prime power, 
(a .0"), q any prime power, 


(q—1,q¢+1), q any prime power, 
(¢+1,q—-1), q any prime power. 


14.1.8. Generalized quadrangles with small parameters 


The proofs of all the results in this section are contained in Chapter 6 of 
[24]. 

Let S be a GQ of order (s,t),1 < 5s <t. If s = 2, then t € {2,4} and 
GQs of order (2,2) and (2,4) are unique. If s = 3, then t € {3,5,6,9}. GQs 
of order (3,5) and (3,9) are unique; up to isomorphism there are two GQs 
of order 3. If s = 4, then t € {4,6,8,11,12,16}. There is just one GQ of 
order 4, and for t € {6,8, 16} a unique example is known. Nothing is known 
about ¢ = 11 and ¢= 12. 


14.2. Translation Generalized Quadrangles 


14.2.1. Translation generalized quadrangles 


For proofs of all results in this section we refer to Chapter 8 of [24] or 
Chapter 3 of [41]. 

Let S = (P,B,1) be a finite generalized quadrangle. A whorl about a 
point p of the GQ S is a collineation fixing p linewise. A whorl about a line 
L is a collineation fixing L pointwise. An elation about the point p € P is 
a whorl about p that fixes no point of P \ p+. Dually, one defines elations 
about lines. If @ is an elation about p, then we will often say that p is the 
center of 0. By definition, the identical permutation is an elation (about 
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every point and every line). If p is a point of the GQ S, for which there 
exists a group of elations G about p which acts sharply transitively on the 
points of P \ pt, then S is said to be an elation generalized quadrangle 
(EGQ) with base-point or center or elation point p and elation group (or 
base-group) G, and we sometimes write (S®),G) or S) for S. Dually, we 
define the base-line of an EGQ. We only work with EGQs that are thick, 
and therefore we will not bother each time to mention this. 

If a GQ (S®),G) is an EGQ with elation point p, and if G is abelian, 
then we say that S is a translation generalized quadrangle (TGQ) with base- 
point or translation point or center p and translation group (or base-group) 
G. The elements of a translation group are called translations. It can be 
shown that the translation group G is uniquely defined by the translation 
point p. 


Theorem 14.5. For any TGQ of order (s,t) we have t > s. 


14.2.2. The kernel of a translation generalized quadrangle 


Suppose (S), G) is a TGQ of order (s,t), s 4 1¥ t, with translation point 
p and translation group G, and let y be a point of P\ pt. Let Lo, L1,..., Lt 
be the lines incident with p, and define r; and M; by D,Ir;jIM;ily, 0 <i < t. 
Put H; = {0 € G|| M? = M,}, H* = {0 € G||r? =r;}, with O <i <t, 
then |H;| = s and |H;*| = st. The kernel K of S®) (or of (S®),G)) is the 
set of all endomorphisms a of G for which H® C H;, 0 <i <t. With the 
usual addition and multiplication of endomorphisms K is a ring. 


Theorem 14.6. The ring K is a field, so that H® = H;,,(H*)* = H* for 
alli =0,1,...,¢ and alla € K® = K\{0}. 


For each subfield F' of K there is a vector space (G, F’) whose vectors 
are the elements of G, and whose scalars are the elements of F’. Vector 
addition is the group operation in G, and scalar multiplication is defined 
by ga = g*,g € G,a € F. It is easy to verify that (G, F’) is indeed a vector 
space. As H; is a subspace of (G, F’), we have |H;| > |F'|. It follows that 
s > |K|. There is an interesting corollary of (G, F’) being a vector space. 


Theorem 14.7. The group G is elementary abelian, so s and t must be 
powers of the same prime. 


Finally we have the following result. 
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Theorem 14.8. The multiplicative group K° induces the group of all 
whorls about p and y, with p & y. 


14.2.3. T(n,m,q)s and translation generalized quadrangles 


In this section, we introduce the notion of T(n,m,q), which is a natural 
generalization of the Ta(O) constructions of Tits, d € {2,3}. 

Suppose H = PG(2n+m-1, q) is the finite projective (2n+m-—1)-space 
over GF(q). Now define a set O = O(n, m, q) of subspaces as follows: O is a 
set of q’” +1 (n—1)-dimensional subspaces of H, denoted by PG (n—1,q), 
and often also by 7;, so that 


(i) every three generate a PG(3n — 1, q); 

(ii) for every i=0,1,...,q™, there is a subspace PG (n +m —1,q), also 
denoted by 7;, of H of dimension n +m — 1, which contains PG (n — 
1,q) and which is disjoint from any PG) (n — 1,q) if 7 i. 


If O satisfies these conditions for n = m, then O is called a pseudo-oval 
or a generalized oval or an [n—1]-oval of PG(3n—1,q). A [0]|-oval of PG(2, q) 
is just an oval of PG(2,q). For n 4 m, O(n,m,q) is called a pseudo-ovoid 
or a generalized ovoid or an [n — 1]-ovoid or an egg of PG(2n+ m-—1,q). A 
[(0]-ovoid of PG(3, q) is just an ovoid of PG(3, q). 

The space PG (n + m — 1,q) is the tangent space of O(n,m,q) at 
PG (n—1,q); it is uniquely determined by O(n,m, q) and PG (n—1,q). 
Sometimes we will call an O(n,n,q) also an “egg” or a “generalized ovoid” 
for the sake of convenience. 

From any egg O = O(n, m, q) arises a GQ T (n,m, q) = T(O) which is a 
TGQ of order (q",q'™) for some base-point (co). This goes as follows. Let 
H be embedded in a PG(2n +m, q) = H". 


e The points are of three types. 


(i) The points of H’ \ H. 
(ii) The subspaces PG(n + m,q) of H’ which intersect H in a 
PGO(n +m —1,q). 
(iii) A symbol (co). 


e The lines are of two types. 


(a) The subspaces PG(n,q) of PG(2n + m,q) which intersect H in 
an element of the egg. 
(b) The elements of the egg O(n, m, q). 
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e Incidence is defined as follows. The point (co) is incident with all the 
lines of Type (b) and with no other lines. A point of Type (ii) is 
incident with the unique line of Type (b) contained in it and with all 
the lines of Type (a) contained in it. Finally, a point of Type (i) is 
incident with the lines of Type (a) containing it. 


Conversely, by interpreting any TGQ in terms of the vector space (G, F’), 
defined in Section 14.2.2, it can be seen that any TGQ is a T(n,m,q) 
associated to some O(n, m, q). 


Theorem 14.9. The geometry T(n,m,q) is a TGQ of order (q",q'") with 
translation point (co) and for which GF(q) is a subfield of the kernel. More- 
over, the translations of T(n,m,q) induce translations of the affine space 
AG(2n+m, q) = PG(2n+m, q)\PG(2n+m-—1,q). Conversely, every TGQ 
for which GF(q) is a subfield of the kernel is isomorphic to a T(n,m,q). 
It follows that the theory of TGQs is “equivalent” to the theory of the sets 
O(n, m, q). 


Remark 14.2. 


(a) We emphasize that with the group of all homologies of PG(2n + m, q) 
having center y not in PG(2n + m — 1,q) and axis PG(2n + m — 1,q) 
corresponds in a natural way the multiplicative group of a subfield of 
the kernel. 

(b) It is clear that T(1,1,q) is a T2(O) of Tits, and that T(1,2,q) is a 
T3(O) of Tits. 

(c) If S = T(O), with O = O(n,m,q), then for fixed n,m, q, we have that 
O is uniquely defined by S. 


Corollary 14.1. For any O(n,m,q) we have n <_m < 2n. 


Corollary 14.2. Let S‘) be a TGQ of order (s,t), s a prime. Then either 
S@) = Q(4,s) or S® = Q(5,s). 


Theorem 14.10. Let (S,G) be a TGQ of order (s,t). Then S@ & 
Ta(O) for some d € {2,3}, if and only if for a fixed point y, y # x, the 
group of all whorls about x fixing y has order s — 1, that is, if and only if 
|| =; 
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14.2.4. Regular pseudo-ovals and regular pseudo-ovoids 


In the extension PG(2n+m-—1, q”) of PG(2n+m—1,q), with m € {n, 2n}, 
we consider n (@ + 1)-dimensional spaces PGO (2 +1,q") = &, with 
i = 1,2,...,n, which are conjugate with respect to the extension GF(q") 
of GF(q), that is, which form an orbit of the Galois group corresponding to 
this extension, and which span PG(2n+m-—1,q"). In &; we consider an oval 


O; for m = n and an ovoid O; for m = 2n. Let O; = {2 a), ee ae 


Further, let oh) ooh), with i = 0,1,...,q’, be conjugate with 
respect to the extension GF(q”) of GF(q). The points al???) see a” 
define an (n—1)-dimensional space PG (n—1, q) = 7; over GF(q), with i = 
0,1,...,q™”. Then O = {70,71,..., qm} is a generalized oval of PG(3n — 
1,q) for m = n, and a generalized ovoid of PG(4n — 1, q) for m = 2n. Here, 
we speak of a regular or elementary pseudo-oval, respectively a regular or 
elementary pseudo-ovoid. If O; is a conic, then we speak of a pseudo-conic 
or a Classical pseudo-oval; if O; is an elliptic quadric, then we speak of a 
classical pseudo-ovoid. 

If m =n, then in the regular case T(n,n, q) = T2(O1); if m = 2n, then 
in the regular case T(n, 2n, q) = T3(O1). 

For m = n each known pseudo-oval is regular. Also for m = 2n with q 
even each known pseudo-ovoid is regular. For m = 2n with gq odd nonregular 
pseudo-ovoids are known, see Section 14.4. 


14.3. Important Properties of O(n,m, q) 


For details and proofs we refer again to Chapter 8 of [24] and to Chapter 3 
of [41]. We remark that pseudo-ovals were already introduced in 1971; see 


[32]. 


14.3.1. Properties of O(n,m,q) 


In this section properties of O(n, m,q), for any n and m, will be stated. Let 
PG(2n +m — 1,q) be the projective space containing O(n, m, q). 


Theorem 14.11. The following hold for any O(n,m,q). 


(i) Each hyperplane of PG(2n + m—1,q) which does not contain a tan- 
gent space of O(n,m,q), contains either 0 or 1+ q'” elements of 
O(n,m,q). If m = 2n, then each hyperplane of PG(4n — 1,q) which 


does not contain a tangent space of O(n,2n,q) contains exactly 1+ q” 
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elements of O(n,2n,q). If m 4 2n, then there are hyperplanes which 
contain no element of O(n,m,q). 
(ii) Eithern =m or n(a+1)=ma with a € No and a odd. 


Corollary 14.3. Let O be the union of all elements of any O(n, 2n,q) and 
let x be any hyperplane of PG(4n — 1,q). Then |ON r| € {71,72}, with 
ya = (q™ — 1)(¢2""! +1)/(q—1) and — 72 = @2"—!. Hence O defines a 
linear projective two-weight code and a strongly regular graph. 


Remark 14.3. For the construction of a linear projective two-weight code 
and a strongly regular graph from a pointset of PG(r,q) with two inter- 
section numbers with respect to hyperplanes of PG(r,q), we refer, e.g., to 
[8]. 


14.3.2. Properties of pseudo-ovals 


In particular, O(n,n,q) has the following properties. 
Theorem 14.12. Let O = O(n,n,q) be a pseudo-oval in PG(3n — 1, ¢q). 


(i) If q is even, then all tangent spaces of O contain a common (n — 1)- 
dimensional space, called the kernel or nucleus of O. 

(ii) If q is odd, then each point of PG(3n —1,q) not in an element of O is 
contained in either 0 or 2 tangent spaces of O, and each hyperplane of 
PG(3n—1,q) not containing a tangent space of O contains either 0 or 
2 elements of O. 


Theorem 14.13. For q odd, the tangent spaces of a pseudo-oval O = 
O(n,n,q) are the elements of a pseudo-oval O* = O*(n,n,q) in the dual 
space of PG(3n — 1,q). 


Definition 14.1. The pseudo-oval O* is called the translation dual of 
the pseudo-oval O. The TGQ T(O*) = T*(O) is also called the translation 
dual of the TGQ T(O). 


Remark 14.4. It is not known whether or not we have always O = O%*, 
respectively T(O) = T*(O). In any case, for each known T(O), with O = 
O(n,n,q) and q odd, we have T(O) = Q(4,q”). 


Theorem 14.14. The TGQ S of order s, with s odd, and its translation 
dual S* have isomorphic kernels. 
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14.3.3. Properties of eggs 


In this section we mention fundamental results on eggs. 


Theorem 14.15. Let O(n,m,q) be an egg, son#Am. 


(i) Each point of PG(2n+m—1,q) which is not contained in an element 
of O(n,m,q) belongs to either 0 or q™~" +1 tangent spaces of the 
egg. If m = 2n, then each point of PG(4n —1,q) not contained in an 
element of O(n,2n,q) belongs to exactly q” + 1 tangent spaces of the 
egg. If m 4 2n, then there are points contained in no tangent space of 
O(n,m, q). 

(ii) For q even we necessarily have m = 2n. Hence for any O(n,m,q) with 
q even, we have m € {n,2n}, that is, for any TGQ S of order (s,t), 
with s and/or t even, we have t € {s, 87}. 


Theorem 14.16. Every three distinct tangent spaces of O(n,m,q),m#n, 
have as intersection a space of dimension m—n-—1. 


By Theorem 14.16 the tangent spaces of O(n,m,q), with m # n, are 
the elements of an egg O*(n,m,q) in the dual space of PG(2n + m — 1,q). 


Definition 14.2. The tangent spaces of an egg O(n,m,q) in PG(2n + 
m —1,q) form an egg O*(n,m,q) in the dual space of PG(2n + m — 1, q). 
So in addition to the TGQ T(n,m,q), a TGQ T*(n,m,q) arises. The egg 
O*(n,m,q) = O* will be called the translation dual of O(n,m,q) = O, 
and T*(n,m,q) = T*(O) = T(O*) will be called the translation dual of 
the GQ T(n, mq) = T(Q). 


Remark 14.5. For regular eggs O, we clearly have O = O%*, respectively 
T(O) = T*(O). For q odd there are examples with O ¥ O*, respectively 
T(O) # T*(O); see Section 14.4. For g even we have O = O* for all known 
examples (all known examples are regular). 


Theorem 14.17. The TGQ S of order (s,t), with s £ t, and its translation 
dual S* have isomorphic kernels. 


14.4. Eggs O(n, 2n,q): Fundamental Results and Characterizations 


In this section we first mention two interesting characterizations of regular 
eggs; see [24]. For the other results we refer to [41]. 
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14.4.1. Characterizations of regular eggs O(n, 2n, q) 
Theorem 14.18. Let O(n, 2n,q) =O be an egg in PG(4n — 1, q). 


(i) The egg O is regular if and only if the q” +1 tangent spaces containing 
any given point z not contained in an element of O, have exactly (q” — 
1)/(q—1) points in common. 

(ii) The egg O is regular if and only if each PG(3n — 1,q) containing at 
least three elements of O, contains exactly q” +1 elements of O. 


14.4.2. Fundamental results on eggs 


This section contains fundamental results on eggs, all of which were discov- 
ered after the monograph by Payne and Thas was published. 

An egg O = O(n, 2n,q), respectively a TGQ T(O), is called good at an 
element 7 € O if for every two distinct elements 7’ and 7” of O \ {7} the 
(3n — 1)-dimensional space < 7,7’,7” > generated by a,7’,7” contains 
exactly q” +1 elements of O; in such a case 7 is called a good element of O. 


Theorem 14.19. ((34]) The egg O(n,2n,q), with q even, is good if and 
only if its translation dual O*(n,2n,q) is good. 


Theorem 14.20. ([34]) Let O be an egg in PG(4n — 1,¢),q odd, which is 
good at its element 7. Then the q?” + q” pseudo-ovals on O containing 7 
are classical. 


Remark 14.6. Relying on Theorem 14.20 a new infinite class of ovoids 
of Q(4,s) and a new infinite class of translation planes was discovered (an 
ovoid of Q(4, s) is a set of s? +1 points of Q(4,s) no two of which are on a 
common line of Q(4, s)); see [39]. 


For q even every known egg O(n,2n,q) is regular, so good at any 
of its elements; hence the corresponding TGQs are TGQs of Tits. For 
q odd, there are O(n,2n,q)s which are not regular. In fact four infinite 
classes of O(n, 2n,q)s, q odd, are known, and two sporadic examples: the 
classical eggs, an infinite class due to Kantor ([18]) (here O(n,2n,q) = 
O*(n,2n,q), O(n, 2n,q) is good at some element, and for each odd q to- 
gether with an automorphism o ¥ 1 of GF(q) there is a nonclassical exam- 
ple), an infinite class of O(n, 2n,q)s with q = 3” and h > 1, deduced from 
Ganley semifields (see [23]) (here O(n, 2n, q) # O*(n, 2n, q), O*(n, 2n, q) is 
good at some element, while O(n,2n,q) is not good at any element), the 
translation duals O*(n,2n,q) of the foregoing O(n, 2n,q)s (it was Payne 
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who discovered in 1989 that O(n, 2n, q) # O*(n,2n,q)), and finally an egg 
O(5, 10,3) together with its non-isomorphic translation dual O*(5, 10,3) 
(these two examples were deduced by Bader, Lunardon and Pinneri ({1}) 
from an ovoid of Q(4,3°) discovered by Penttila and Williams ([25]) (one 
of these eggs is good). 

Let F be a flock of the quadratic cone K in PG(3, q), that is, a partition 
of K minus its vertex into gq nonsingular conics. In 1976 it was shown by 
Walker ([44]), and independently by Thas, that F’ defines a translation plane 
P(F) of order q?. In 1987 Thas ([33]), relying on work of Kantor ([17,18]) 
and Payne (({21,22]), proved that with each flock F’ there corresponds a 
GQ S(F) of order (s?,s), called a flock GQ. The paper [33] was the origin 
of an explosion of interest in the theory of GQs and led to many results, 
also because of the fact that now certain GQs of order (s?,s) and certain 
translation planes of order s? were closely linked. 


Theorem 14.21. ([16]) A TGQ T(O) of order (s,s), s even, is the point- 
line dual of a flock GQ if and only if O (respectively, T(O)) is classical. 


Theorem 14.22. ((34,37]) The TGQ T(O) of order (s,s”), s odd, is the 
point-line dual of a flock GQ if and only if the translation dual O* of O is 
good at one of its elements. 


Remark 14.7. Theorem 14.22 is a particular case of a more general result 
of [37]. 


We also mention that there is a classification in [36] of good eggs of 
PG(4n — 1,q), q odd, in terms of Veronese surfaces and their projections. 

In Section 14.1.4 we mentioned the beautiful theorem of Brown on ovoids 
in PG(3, q), with q even. The next theorem generalizes this result to pseudo- 
ovoids. 


Theorem 14.23. ({7]) Let O(n,2n,q) be an egg in PG(4n — 1,q),q even. 
Then O(n, 2n,q) is classical if and only if it contains a pseudo-conic. 


Other interesting characterization theorems are the following. 


Theorem 14.24. ([40]) Let O be an egg in PG(4n —1,q),q even, which is 
good at x € O. If the q?" +.q" pseudo-ovals on O containing m are regular, 
then O is regular. 


Theorem 14.25. ((4]) Assume that O(n,2n,q), q odd, is good and that 
q > 4n? —8n+2. Then either O(n, 2n,q) is classical or is an egg of Kantor 


type. 
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Remark 14.8. Many interesting and useful characterizations of eggs in- 
volving subquadrangles and automorphism groups can be found in [24] and 
[41]. 


14.5. Pseudo-Ovals: Old and New Results 


All known pseudo-ovals are regular; so for g odd all known pseudo-ovals are 
classical. Compared to eggs, not many results on pseudo-ovals are known. 
Here we collect some interesting results on these objects, including very 
recent ones. 


14.5.1. Old results 


Let O(n,n,q) = {70,71,-.-,7qn} be a pseudo-oval in PG(3n — 1,q). The 
tangent space of O(n,n,q) at 7; will be denoted by 7;, with i = 0,1,...,q”. 
Let PG(2n—1,q) C PG(3n—1,q) be skew to 7;, 7 € {0,1,...,q"}. Further, 
let 7,NPG(2n — 1,q) = &, < m7; > NPG(2n — 1,q) = &,7 #1. Then 
{€, €1,-.-,€qn } = 5; is an (n—1)-spread of PG(2n—1, q), that is, a partition 
of PG(2n — 1,q) consisting of g” + 1 (n — 1)-dimensional subspaces. 

Now let q be even and let 7 be the nucleus (cf. Section 14.3.2) of 
O(n, n, q). Let PG(2n—1, q) be skew to 7. If ¢; = PG(2n—1,¢)N < 9, 2; >, 
then {Co,¢1,.--, gn } = S is an (n — 1)-spread of PG(2n — 1,4). 

Let q be odd. Let 4 € {0,1,...,q"}. Put 7.97, = 6;,7 # 4. Then by 
Theorem 14.12, {7;, 50, ..., 6:-1, Oi41,---, gr} = S7 is an (n — 1)-spread of 
Ti. 


Theorem 14.26. ([9]) Consider a pseudo-oval O(n,n,q), with q odd. Then 
at least one of the (n—1)-spreads So, S1,..., Sqn, SG, ST,---,Sgn ts regular 
if and only if they are all regular if and only if O(n,n,q) is classical if and 


only if the corresponding GQ T(n,n,q) is isomorphic to the classical GQ 
Q(4,q”). 


Let O(n, n,q) = O be a generalized oval in PG(3n—1,q), with q even. Then 
O is a translation generalized oval, with axis the tangent space PG (2n — 
1,q) of O at PGM(n — 1,q) € O, if there is a group of involutions of 
PGL(3n,q) with axis PG)(2n — 1,q), fixing O and acting regularly on 
O\{PG™ (n—1, q)}. It can be shown that it is sufficient that there is a group 
of involutions of PGL(3n, q) fixing PG (n — 1,q) and acting regularly on 
the remaining elements of O. If n = 1, then a translation generalized oval is 
just called a translation oval. All translation ovals of PG(2,q),q = 2”, were 
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determined by Payne ([20]); choosing suitable coordinates, they are always 
of the form 


{(1, t,t?) || t € GF(q)} U {(0, 0, 1)}, 
where 7 is fixed in {1,2,...,h —1} and (h,i) = 1. 


Theorem 14.27. ((40]) Let O be a generalized oval in PG(3n —1,q), with 
q even. Then O is a translation generalized oval if and only if the point-line 


dual of T(O) is a TGQ. 
Remark 14.9. For n = 1 the theorem can be found in the monograph [24]. 


Let O = {m,71,..., qn} be a generalized oval in PG(3n—1, q), with g even, 
and let 7 be the nucleus of O. Further, let 7 be the tangent space of O at 
m. For each i € {1,2,...,q”}, the set 


10; Oy 4 < ae, ey Sir laze oh 


is an (n — 1)-spread of 7, denoted S;. If O is a translation generalized oval 
with axis T, then all the spreads S; coincide. 

Suppose that all the spreads 5; coincide. We say that O is projective at 
T if the following property holds : 

Let y be an element of S = S; (for alli),where st 4 y 4 n, and let j,k 
be in {1,2,...,q"}, J #k, such that < 7,1; >#< 7,1 >. As the spreads 
S; coincide, there are elements mT; and Te so that my C< y,7j; >, with 
9 9's ond tye Cm ym, > withk fk’. Then < 17, my > N= Ty > ts 
an element of S. 


Theorem 14.28. ({40]) Let O be a generalized oval in PG(3n — 1,q), 
even, and use the above notation. Then O is a translation generalized oval 
with axis T if and only if all the spreads S; coincide and O is projective at 
rs 


Let O = {m,71,...,7qn} be a generalized oval in PG(3n — 1,q), with 
q = 2", and let 7 be the tangent space of O at 7. Now we define a point-line 
incidence structure A(O) as follows 


e Points are the elements of O \ {7}; 
e Lines are the pairs {7;,2;} with 7,7 € {1,2,...,q"} andi Fj; 
e Incidence is containment. 
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Hence A(QO) is the complete graph with vertex set O\ {7}. Further, two 
lines {7;,7;} and {7,7} are called parallel if < 1,7; > NT =< m,,™ > 
Nr. 


Theorem 14.29. ({40]) The incidence structure A(O) provided with par- 
allelism is isomorphic to the hn-dimensional affine space AG(hn,2) over 
GF (2) if and only if O is a translation generalized oval with axis T. 


We also have the following characterization of classical generalized ovals 
in the even case. 


Theorem 14.30. ({40]) The pseudo-oval O(n,n,q), q even, is classical if 
and only if O(n,n,q) is 2-transitive. 


Relying on the previous theorem, Bamberg and Penttila ([2]) obtained the 
following stronger characterization. 


Theorem 14.31. ([2]) The pseudo-oval O(n,n,q), q even, is classical if 
and only if O(n,n,q) is transitive. 


14.5.2. New results 


In this section yet unpublished results on pseudo-ovals will be discussed; 
see [38]. 

Let 71,72,73 be mutually skew (n — 1)-dimensional subspaces of 
PG(3n — 1,q), let 7; be a (2n — 1)-dimensional space containing 7; but 
skew to 7; and zx, with {t,j,k} = {1,2,3}, and let 77; = mm with 
{i,j,k} = {1, 2,3}. The space generated by 7; and 7; will be denoted by ¢;, 
with 7 = 1,2, 3. If the (2n — 1)-dimensional spaces ¢1, ¢2,¢3 have a (n — 1)- 
dimensional space in common, then we say that {71,72,73} and {71, T2, 73} 
are in perspective; if ¢1,¢2,¢3 have a nonempty intersection, then we say 
that {71,72,73} and {7, 72,73} are in semi-perspective. 

Let O be a regular generalized oval in PG(3n — 1,q), with q odd. Then 
for any three distinct elements 7;,7;,7, of O the sets {7;,7;,7,} and 
{T;,Tj;,T} are in perspective, where 7 is the tangent space of O at 7, 
l € {i,j,k}. This follows immediately from the fact that in the odd case, 
by the Lemma of Tangents (that is, Lemma 14.1), this property holds for 
every oval. In the even case, by Theorem 14.12(i), the triples {7;,7;, 7} 
and {7;,7;,7%} are in perspective for any three distinct elements 7;,7;, 7% 
of any generalized oval O (here (, 9 G2 1 G3 is the nucleus of O). 

Now we state an interesting property on quadrics in PG(38n—1,q),n > 1. 
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Theorem 14.32. Let Q be a nonsingular quadric in PG(3n — 1,q), q odd, 
and let 71, 72,73 be distinct (n—1)-dimensional spaces on Q which generate 
PG(3n — 1,q). The tangent space of Q at 7; is denoted by 7;, with i = 
1,2,3. Further, assume that 7,07; = @, for alli # j. Let me = 1497; 
wiht. ter = (lob) ond let G =< hay S; wot t = 152.8) Ther. the 
dimension of (4, N¢2N¢3 has the same parity as n —1. In particular, if n 
is odd, then {71,772,773} and {T1, 72,73} are always in semi-perspective. 


Remark 14.10. Since we are mainly interested in generalized ovals we 
assume in Theorem 14.32 that q is odd. In the even case the statement 
is quite different, so for example if g is even and n = 2 we always have 


GN@N¢gs =. 


Let q be odd. Then for each n even any classical generalized oval of 
PG(3n — 1,q) belongs to a nonsingular elliptic quadric Q~ (3n — 1,q) and 
to a nonsingular hyperbolic quadric Q*(3n—1,q), and for each n odd, any 
classical generalized oval of PG(3n—1, q) belongs to a nonsingular parabolic 
quadric, see [31]. In each of these cases the tangent space at any element 7 
of the generalized oval coincides with the tangent space at a of any of the 
corresponding quadrics. For n = 2 any classical generalized oval O is the 
intersection of some Q~ (5, q) and some Qt (5, q); it is contained in (¢+1)/2 
nonsingular elliptic quadrics and (q+ 1)/2 nonsingular hyperbolic quadrics, 
and at each element L of O these quadrics have a common tangent space 
which coincides with the tangent space of O at L. Shult and Thas ((30]) 
prove that if O is a generalized oval of lines contained in a nonsingular 
hyperbolic quadric Q* (5, q) of PG(5, q), g odd, then O is classical. 

As for q even all tangent spaces of a generalized oval contain a common 
(n—1)-dimensional space, it follows that for n > 1 and q even, a generalized 
oval is never contained in a nonsingular quadric. 

In recent years there was great interest in generalized ovals consisting 
of q? +1 lines of Q~(5,q), with q odd. Such a generalized oval is equivalent 
to a set of gq? + 1 points of the nonsingular Hermitian variety H(3,q7) in 
PG(3,q"), such that the plane defined by any three distinct points of this 
set is nontangent to H(3,q?). This object was studied in different contexts 
in [29], [11] and [10]. 


Theorem 14.33. Let O be a generalized oval of PG(3n—1,q), with q odd, 
contained in a nonsingular quadric Q of PG(3n — 1,q). If 7 € O, then the 
tangent spaces at 7 of O and Q coincide. 


Corollary 14.4. Let O be a generalized oval of PG(3n — 1,q), with q odd, 
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contained in a nonsingular quadric Q of PG(3n — 1,q). Then any three 
distinct elements 71, 72,73 of O satisfy the requirements in the statement 
of Theorem 14.82. 


Next we state an elegant characterization of pseudo-conics, generalizing 
a characterization of conics from Section 14.1.4 (Theorem 14.2, relying on 
the geometric interpretation of Lemma 14.1). 


Theorem 14.34. Assume that O = {70,71,...,%qn} ts a pseudo-oval of 
PG(3n — 1,q), q odd, and let 7; be the tangent space of O at 7;, with i = 
0,1,...,q”. If for any three distinct i,j,k with i,j,k € {0,1,...,q"} the 
triples {m;,7;,7} and {7;,7;,Te} are in perspective, then O is a pseudo- 
conic; clearly the converse also holds. 


We will now give the formulation of Theorem 14.34 in terms of gener- 
alized quadrangles. 


Theorem 14.35. Let S be a TGQ of order s, s odd and s #1, with base- 
point x. Further, let Ly, L2,L3 be distinct lines incident with x and let y; 
IL;,,2#y;, with i = 1,2,3. Then S is isomorphic to the GQ Q(4,s) if 
and only if there are s points 21, Z2,..., 25 not collinear with x such that the 


line Mj; incident with z; and concurrent with xy; contains a point collinear 
with y, and yj, with 17, k,l} ={1, 2,3} and t= 1,2,...455. 


There is also an interesting interpretation of Theorem 14.34 in terms of 
Laguerre planes; see [38]. 
Now we will state another property on quadrics in PG(3n — 1,q). 


Theorem 14.36. Let 7,71, 72 be mutually skew (n — 1)-dimensional sub- 
spaces of PG(3n—1,q), and let 7; be a (2n—1)-dimensional space containing 
1; but skew to 1; and my, with {i,j,k} = {0,1,2}. Coordinates are chosen 
wn such a way that 19 =< C0, Cis x% s9@n—1 Sy Wi << Ons Citys, Con—1 Ss 
T2 =< €2n, €2n+1,---,€3n—1 >, where e; has coordinate 1 in positioni+ 1, 
and zero elsewhere. Further, let 


Igy Kay, 
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Xo Xan 
m2 68 as ; = 0, 
Xn-1 X3n-1 
Xo X, 
72% : +7 = 0); 
Xn-1 Xon-1 


for certain n x n matrices a,3,y (then det(aBy) #4 0). Then there is a 
quadric containing 7o,71,72 and having 7; as tangent space at 7;, with 
i =0,1,2, if and only if the matrix equation 


Z6 = Z" , with 6 = ay, 


has a nonsingular solution for then x n matrix Z. Moreover the quadric 
is nonsingular if and only if 0+ ¢€ is nonsingular, that is, if and only if 
T1719 72 = (here € is the identity matrix). 


Corollary 14.5. For n = 2 such a quadric exists if and only if det(@) = 1 
with either Tr(@) 4 2 or 0 =e, that is, with either det(@—¢) #0 or 0 =e. 


Remark 14.11. The condition 0 = ¢€ is equivalent for {70,71,72} and 
{T0, 71,72} to be in perspective; det(@ — <)det(@ + ¢) = 0 is equivalent for 
{mo, 71,72} and {70, 71, 72} to be in semi-perspective. 


Another interesting result towards the classification of all pseudo-ovals 
of PG(5,q), with q odd, is the following result. 


Theorem 14.37. Let O = {Lo,Ii,...,Lq2} be a generalized oval in 
PG(5,q), with q odd, and let 7; be the tangent space of O at L;, with 
i = 0,1,...,q%. Then for any three distinct i,j,k in {0,1,...,q7} there 
is either a quadric containing L;,L;,L;, and having {7;,7;,Th} as tangent 
spaces at respectively L;,L;, Ly, or {Li, Lj, Ly} and {7;,7;, 7%} are in semi- 
perspective but not in perspective. 


Remark 14.12. In order to prove Theorem 14.37 a new type of “Lemma 
of Tangents” was developed. 
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Chapter 15 


Super Geometry as the Basis for Super Symmetry 


V. S. Varadarajan 
University of California, 
Los Angeles, USA 
vsv@math.ucla. edu 


15.1. Introduction 


Mathematicians who have an interest in physics know that a new and ex- 
tremely powerful collider is being readied at CERN, the European Center 
for Nuclear Research, in Geneva, Switzerland. It is an order of magnitude 
more powerful than Fermilab, the host of the currently most powerful such 
machine. It is expected that the new collider will make fundamental dis- 
coveries about the nature of physics in ultra-small distances and times, 
thereby revealing a lot about the structure of elementary particles. Among 
the questions that may be answered by the experiments with the new ma- 
chine are the existence of the super particles which have been predicted by 
what the theoretical physicists call super symmetry. I say may in the above 
because it is not known at what energy level in the past super symmetry 
was broken, and so one cannot be sure that the collider would reveal the 
presence of super symmetry. Nevertheless, because of its beauty, it would 
be surprising if Nature has not made use of super symmetry (to modify a 
famous remark of Dirac about monopoles). 
For the mathematician there are two basic questions. 


(i) What is super symmetry and what has it to do with symmetry, one of 
the oldest parts of mathematics? 

(ii) How does the presence of super symmetry lead to the existence of super 
partners of the known elementary particles? 


In this short article I shall attempt to examine these two questions briefly. 
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There is nothing that is new in what I am going to say, and all of it is known 
to experts. But the general mathematician may not be aware of some of 
the things I shall talk about, and this article is intended for such a person. 
Almost all of what I shall say is based on joint work with Gianni Cassinelli, 
Alessandro Toigo and Claudio Carmeli of the theoretical physics group at 
the University of Genoa and INFN, Genoa, Italy. 

It is very well known [1, 2] that the notion and uses of symmetry go back 
to very ancient times and are tied up with concepts of group theory. In the 
most primitive setting we have a group G acting on a set X, and one views 
the actions of the elements of the group as “symmetries” of X or symmetries 
of the system whose states are described by the points of X. If G is a Lie 
group one can often replace the action of G by the infinitesimal action of 
g, the Lie algebra of G. Sometimes we only have the action of g; this is 
the case when the Lie algebra is infinite dimensional and the corresponding 
group is more ambiguous to define because of topological difficulties, as for 
example, when g is an affine Kac-moody Lie algebra or a Virasoro algebra. 

In classical physics, and even in quantum mechanics, there is no neces- 
sity to question the use of flat Minkowskian geometry for spacetime, since 
gravitational forces are negligible in that scale. It is only when experiments 
began to probe extremely small distances that theories trying to understand 
and predict the experiments began to encounter serious conceptual difficul- 
ties. Physicists then began to look more closely into the structure of space- 
time at ultrashort scales of distances and times. The Planck scale refers to 
distances of the order of 10~°%cem and times of the order of 1074% sec. At the 
Planck scale, no measurements are possible and so conventional models can 
no longer be relied upon to furnish a true description of phenomena. String 
theory attempts to work in a framework where the smallest objects are not 
point-like but extended, i.e., strings or (more recently) membranes. Space- 
time geometry at the Planck scale is thus almost surely non-commutative 
because there are no points. No one has so far constructed a convincing 
geometrical theory which is noncommutative at the Planck scale but has 
the Riemann-Einstein geometry as a limit when the scale becomes the usual 
one. 

Even at energies very much lower than the Planck scale, a better un- 
derstanding of phenomena is obtained if we assume that the geometry of 
spacetime is described locally by a set of coordinates consisting of the usual 
ones supplemented by a set of anticommuting (Grassmann) coordinates. 
Why Grassmann coordinates? The Grassman coordinates model the Pauli 
exclusion principle for the Fermions in an embryonic form. The theory of 
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classical fields on such a manifold would then provide a basis for quanti- 
zation that will yield the exterior algebras characteristic of quantum de- 
scriptions of Fermionic states. Such a manifold is nowadays called a super 
manifold. The symmetries of super manifolds are super Lie groups. 

It must be noted that the original super symmetries discovered by the 
physicists were infinitesimal and acted on matter fields which, because of 
the Dirac equation, were complex Fermiominc fields. If A, B are two such 
symmetries, the anticommutator [A,B], := AB + BA is also one. This 
is very different from the usual infinitesimal symmetries governed by Lie 
algebras, where, for given A, B, the usual commutator [A, B] = AB — BA 
is also an infinitesimal symmetry. The physicists’ examples are actually 
super Lie algebras and the whole theory acquired a beautiful shape when 
Kac [3] introduced the notion of a super Lie algebra and classified the 
simple ones. However the theory of these symmetries still lacked a geometric 
underpinning till Salam, Strathdee, and others came up with the beautiful 
idea that it is essential to generalize the notion of a classical manifold 
to include Grassmann coordinates in order to have the proper geometric 
foundation for the super symmetries [4, 5}. 

As I shall explain below, the physicists’ treatment of super symmetry 
and its consequences can be fitted into a rigorous framework of super man- 
ifolds, super Lie groups, and their actions on super manifolds. The result is 
a beautiful extension of classical geometry with a possibility of applications 
to the most fundamental parts of the physical world. For some physicists’ 
views see [6]. For a survey see [7]. 


15.2. Super Geometry 


Normally, manifolds, groups, and their actions are described by first describ- 
ing them at the set-theoretical level and then adding the structural features 
(smoothness etc). In super geometry also this is the way things are done; 
but to achieve this geometrically intuitive picture it is necessary to take 
into account the presence of the odd coordinates (Grassmann variables) in 
an essential manner. The basic language to describe super manifolds is that 
of ringed spaces, namely, a topological space with a sheaf of rings on it. To 
encode the Fermionic structure of matter the rings of the sheaf are required 
to be non commutative, more specifically, super commutative. This means 
that they are Zo-graded and satisfy the super commutativity axiom: 


ab = (— 1)P)P) bq 
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where p is the parity function: p(x) is 0 for x even and 1 for x odd. The 
standard examples are R?!? = (R?, OP!2) where the space is R? and for any 
open set U, the ring of the sheaf OP!4 is C®(U) @A(E1,.--, q), A(Ex, -- «5 €q) 
being the Grassman algebra in the indeterminates £),...,€ . A general su- 
per manifold is a ringed space (X,O) which is locally isomorphic to R?!7. 
The odd local coordinates €; cannot be observed but their presence in the 
fundamental sheaf alters the consequences of the theory. This is like the 
Grothendieck notion of a scheme in algebraic geometry where the structure 
sheaf contains nilpotents which are not observable but which have funda- 
mental geometrical consequences. By equating all odd coordinates to 0, we 
obtain from a super manifold M a classical smooth manifold Mo, which 
can be realized as a submanifold of M. M may thus be interpreted as a 
classical manifold Mo together with a Grassmann cloud over it. 

The super manifolds form a category in a natural manner. Of course 
I have only spoken about real C® super manifolds but one can define 
in a similar fashion the real or complex analytic super manifolds also. A 
super Lie group is a group object in the category of super manifolds. More 
precisely, in order to endow a super manifold G with the structure of a 
super Lie group, we must specify two morphisms 


piGxG—GG, t:G—G 


which are the multiplication and inverse maps, satisfying the appropriate 
axioms for associativity. In order to speak of symmetries of super manifolds 
we must introduce super group actions on super manifolds. In principle this 
is easy: a super Lie group G acts on a super manifold M if there is an action 
morphism 


a:GxM—M 


satisfying the appropriate associativity constraints expressed in terms of 
the maps a, [, L. 

The super manifolds are not easy to work with because of the involved 
nature of their definition which makes geometric motivation very hard to 
come by. In order to overcome this difficulty one follows the path laid out by 
Grothendieck in algebraic geometry, namely, the language of the functor of 
points. If we have a category C and JM is an object in it, one has associated 
to it the contravariant functor T —> M(T) where M(T) = Hom(T, ™). 
Elements of M/(T) are called the T-points of M. If M1, M2 are two objects in 
C, any morphism M, —> Mo gives rise to a natural map M1(T) — M2(T) 
between their respective functors of points. It is a simple but fundamental 
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result ( Yoneda’s lemma) that any natural map M\(T) —> Mo(T) arises in 
this way from a unique morphism WM, —> Mo. In this manner M can be 
entirely replaced by the functor T —> M(T). The natural question is of 
course to ask which contravariant functors from C to the category of sets 
arise as the functors of points of objects of C. Such a functor is called repre- 
sentable. In practice the basic questions always come down to showing that 
certain functors constructed naturally from given representable functors, 
are representable. 

This language restores the geometric intuition completely. For instance, 
if M;(i = 1,2,...,n) are super manifolds, the functor 


T — M,(T) x M2(T) x --- x M,(T) 


is representable and the object representing it is the product super manifold 
M, x Mz x---x M,. Super Lie groups correspond to representable functors 
from the category of super manifolds to the category of groups. Thus, if 
G is a super Lie group, G(T) is a group for every T and T —> G(T) is 
a functor into the category of groups. It is important to realize that for 
any fixed T, G(T) has no structure other than that of a group, and that the 
super Lie group structure of G is to be recovered from the entire assignment 
T — G(T) and its functorial nature. The classical part Go of a super Lie 
group G is a classical Lie group. It is a general principle that all topological 
issues regarding a super Lie group G are always resolvable in terms of its 
classical part Go. For a super Lie action of G on M the description in terms 
of functor of points is equally intuitive: the action is equivalent to an action 
G(T) x M(T) —> M(T) in the usual sense for each T,, functorial in T. 
Just as in the classical case one can associate super Lie algebras to 
super Lie groups. The assignment G —> Lie(G) that takes a super Lie 
group to its (super) Lie algebra is functorial and as in the classical case 
captures the group structure in essence. Because of its linearity, the super 
Lie algebra is a much easier object to handle than the corresponding group, 
and physicists invariably work with the super Lie algebra. However what 
captures the complete information is the so-called super Harish-Chandra 
pair (Go,g) associated to the super Lie group G: here Go is the classical 
Lie group underlying G, g = Lie(G), and the two objects are interlocked by 
the action of Go on g. It is a fundamental theorem that the assignment 


G— (Go, g) 


is functorial and is an equivalence of categories from the category of super 
Lie groups to the category of super Harish-Chandra pairs. 
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15.3. Super Symmetric Extensions of Relativistic Theories 
and Super Poincaré Groups 


The relativistic invariance of the description of a physical system is usually 
guaranteed by giving an action of the Poincaré group on the space of states 
of the physical system. In order to obtain a super symmetric extension of 
such a theory it is first of all necessary to enlarge the Poincaré group to a su- 
per Lie group whose classical part is the Poincaré group. Such enlargements 
are called super Poincaré groups and I shall discuss their construction now. 
Let me first of all describe how Lie algebras can be enlarged to super 
Lie algebras. If g is a super Lie algebra with even and odd parts go, g1, 
then we have the following (to be pedantic, the ground field should have 
characteristic different from 2 and 3; here we may take it to be R or C). 


(i) go is a Lie algebra and gj is a go-module 
(ii) The super bracket [-,-] is a symmetric bilinear map of g1 x gi into go 
which is go-equivariant 
(iii) For all X € gi, we have [X, |X, X]] =0. 


Only the condition (iii) is non-linear and so requires care in handling. Let 
t be an ideal of go and let us assume that t acts trivially on g; and [-,-| 
in (ii) maps into t. Then (iii) is automatic. Although this is the preferred 
mode of construction of super Lie algebras in the physics literature, there 
are other methods and examples. 

In the physical situations the Lie algebra go is a Poincaré Lie algebra. By 
this we mean that go = Lie(ISO(V)°)) where ISO(V)° is the inhomogeneous 
Lorentz group attached to V, namely, the semidirect product V x’ SO(V)° 
where the superindex 0 denotes the connected component of the identity. 
Clearly go is the semi direct sum V x’ Lie(SO(V)), and any module for 
Lie(SO(V)) can be viewed as a module for go in which V acts trivially. The 
physical requirements need the following. 


(iv) If the ground field is R and go = V x’ Lie(SO(V)) where V is a 
real vector space with a non-degenerate quadratic form (the metric) 
of Minkowski signature, then g;, as a gg-module, is spinorial. 


The meaning of spinoriality for a go-module is as follows. Briefly, a 
module for go = V x’ Lie(SO(V)) is spinorial if V acts trivially and as a 
Lie(SO(V ))-module it splits over C as a direct sum of spin modules. Let me 
recall what the spin modules are. The orthogonal Lie algebras are simple 
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(except when dim(V) = 4 when it is the direct sum of 2 orthogonal Lie 
algebras in dimension 3) and their irreducible modules are generated in a 
natural sense by the fundamental modules whose highest weights are delta 
functions on the Dynkin diagram. Thus there is one to each node of the 
diagram and the ones corresponding to the right extreme nodes are the spin 
modules. There are 2 of them when V is even dimensional and 1 when V 
is odd dimensional. If Spin(V) is the two-fold cover of SO(V)°, then these 
modules can also be described as the irreducible modules of go of minimal 
dimension which do not come down to modules for SO(V)° (they always 
lift to Spin(V)). 

The super Poincaré algebras and the associated super Poincaré groups 
are defined as follows. We start with 


do = V x’ Lie(SO(V)°), Go =V x’ Spin(V) 


and select a module S$ for Lie(SO(V)°) which is viewed as a module for go 
in which V acts trivially. Then g := go @S is a super Poincaré algebra, and 
(Go, g) is the corresponding super Poincaré group if: 


(i) Go acts on gi = S spinorially. 
(ii) The odd bracket g; x gi —> to maps into the closure of the open for- 
ward light come (minus the origin) (positivity of energy). 


Theorem 15.1. Given any spinorial module S there is a super Poincaré 
group (Go, g) with go = Lie(Go),g1 = S. The bracket on gi is projectively 
unique if S is irreducible. 


Given V it is thus clear that there are many super Poincaré algebras 
with classical part equal to V x’ Lie(SO(V)). If S is irreducible this super 
Poincaré algebra is essentially unique. If r is the number of irreducible 
constituents of S, physicists speak of N = r super symmetry. The case 
when V has the Minkowski signature is most important for applications 
but one can classify the modules S for all signatures. The simplest special 
case arises when dim(V) = 4 and V has Minkowski signature (+ — ——). 
The group Spin(V) is then SL(2,C)R where the suffix R means that the 
group is viewed as a group over R. There are now 2 spin modules, namely 
the two dimensional representation 2 and its complex conjugate 2. The 
representation 2 @ 2 is then real and we write M for it, so that Mc := 
C @R M = 2622. The physicists call M the Majorana spinor. Any real 
spinorial module for SL(2,C)g is a direct sum of copies of M and so to 
get the smallest super Lie algebra extending go we must take g; to be M. 
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To see that we have a projectively unique symmetric map M ® M — V 
which is equivariant with respect to the spin group, it is easy to reduce it 
to the complex case; there it follows from the decomposition 


(Mc ® Mc)*"™™" =306304 


where 3 is the complex 3-dimensional irreducible representation, and 4 is 
the unique 4-dimensional irreducible which descends to SO(V)°. 

This super Lie algebra was the first to be constructed historically. In or- 
der to determine all the super Lie algebras extending the Poincaré algebra 
in arbitrary dimension and Minkowski signature one must have complete 
information on the reality of spin modules and the circumstances when 
they admit symmetric invariant forms. It turns out that this can be done 
and so one has a description of all super Poincaré algebras [8] (hence the 
corresponding super Lie groups). For a general discussion of spinor calculus 
see [8, 9]. The reference [9] is a profound tour de force of everything math- 
ematical that one wants to know about super symmetry (and more!). See 
also 6c, 6d, 6e, 6f, 6g, 6h. 


15.4. Classification of Super Particles 


Let P be a Poincaré group or better, its two-fold cover V x'Spin(V), and let 
G be asuper Lie group whose classical part is P: Go = P. The first question 
is to determine the free elementary particles which admit super symmetry 
and to describe the states of such particles. Without super symmetry the 
classification of elementary particles is through their mass and spin. The 
Hilbert space of states of the particles must carry a representation of P 
and elementarity requires the representation to be irreducible. Thus the 
free elementary particles are in bijection with UIR’s (unitary irreducible 
representations) of P. However not all UIR’s define physically realizable 
particles: the mass must be real or the energy must be > 0. Mathematically 
it is a question of determining the UIR’s of a semi direct product P = V x’ L 
where V is a real vector space (spacetime) and L is a closed subgroup of 
GL(V). For determining the UIR’s of such groups we have available the 
Mackey-Wigner [16, 17] method of little groups. I shall now explain briefly 
this method. The group L acts on V and so, by duality, on the dual V* (the 
momentum space). We can then speak of the orbits in V* for this action of 
L. Fix a point p € V*. The stabilizer L, of p in L is called the little group 
at p. Then to each UIR a of Ly one can associate a UIR of P in a canonical 
manner which we denote by U”:?. The unitary equivalence class of U?’? 
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does not change if we change p to some other point in the orbit of p. The 
Hilbert space of o is the space of spin states of the particle (represented by 
the UIR U”:’) for the given momentum p. The orbit of p and the unitary 
equivalence class of o are the parameters of the UIR of P thus defined. 
In the case when V is a quadratic vector space with Minkowski signature 
and L is Spin(V), the orbits are given essentially as the level sets of the 
dual quadratic form on V* and so represent sets of constant energy. In 
pathological cases the group P may have additional UIR’s than the U”’’. 
However when the action of L on V* is regular in a certain sense, there 
are no other UIR’s of P except for the U”’?. Regularity means that we 
can find a Borel set in V* meeting each orbit exactly once. It is a theorem 
of Effros [18] that this happens if and only if each orbit is locally closed, 
i.e., is open in its closure. The Minkowskian vector spaces are regular in 
this sense and thus we obtain the classification of elementary particles in 
terms of mass and spin. Notice that we have to rule out the negative energy 
particles by fiat. 

The super Poincaré groups are semi direct products also. The subspace 
a := V @g; is a sub super Lie algebra of g since the bracket gi x gj 
goes into V. It is often called a super translation algebra. It is easy to 
construct explicitly the corresponding super Lie group which we call A so 
that Ag = V. The action of the spin group Spin(V) on a lifts to an action 
on A and allows us to form the semi direct product A x’ Spin(V) which we 
denote by G. The question now is the classification of the UIR’s of G. 

This was essentially done by the physicists {[19, 20] in the 1970’s. They 
assumed that the Mackey-Wigner method of little groups would extend to 
the super symmetric context and determined the little groups and the space 
of states for the little groups for a given momentum vector (on which the 
little groups act). There were several novel features in the situation that 
were not present in the theory when no super symmetry is assumed. 


(i) Only the orbits with energy > 0 give rise to UIR’s. Thus super sym- 
metry makes energy positive. 

(ii) For a given momentum p let H, be the space of states of fixed spin; 
the new space of states at p turns out to be H, ® W, where W, is 
an irreducible representation of a certain Clifford algebra canonically 
associated to p. 

(iii) A super particle, when viewed as a quantum system for the usual 
Poincaré group, splits as a direct sum of particles in the usual sense, 
forming a multiplet; the multiplet contains pairs of particles with same 
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mass but opposite parity of spin (super partners), leading to one of the 
most striking consequences of super symmetry, namely, the existence 
of super partners. (In practice, because of super symmetry breaking, 
the partners acquire mass.) 


How are we to understand these results on a rigorous basis? Clearly what 
one wants is an extension of Mackey’s classical theory of UIR’s for regular 
semi direct products to the super symmetric case. I shall devote the next 
section to a discussion of this approach [21]. 


15.5. UIR’s of Regular Super Semi Direct Products 


We begin with some remarks on super Hilbert spaces. All sesquilinear forms 
are linear in the first argument and conjugate linear in the second. A super 
Hilbert space is a super vector space H = Ho ® H, over C with a scalar 
product (- , -) such that H is a Hilbert space under (- , -), and H;(z = 0, 1) 
are mutually orthogonal closed linear subspaces. If we define 


0 if x and y are of opposite parity 
(x,y) = ¢ (x,y) if a and y are even 


i(x,y) if # and y are odd, 


then (x,y) is an even super Hermitian form with 
(y, x) = (—1)?™)P (x,y), (x, x) > O(a 4 0 even) 
and 
i" (x, 2) > 0(2 4 0 odd). 


If T(H — H) is a bounded linear operator, we denote by T* its Hilbert 
space adjoint and by T" its super adjoint given by 


(Ta, y) = (-1POP@ (a, Thy). 


Clearly T? is bounded, p(T) = p(T"), and T? = T* or —iT* according as T 
is even or odd. For unbounded T we define T* in terms of T* by the above 
formula. These definitions are equally consistent if we use —7 in place of 7. 
But our convention is as above. 

The first problem to understand is how to define the concept of a unitary 
representation (UR) of a super Lie group. The notion of a representation p 
of a super Lie algebra g is quite clear. However if X in an odd element of 
g, then [X, X] must go over under p to p(X)?; but as [X, X] € go, it will in 
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general be represented by an unbounded operator in the Hilbert space of the 
representation, and so it becomes clear that X itself must be represented 
by an unbounded operator. This means that domain considerations enter 
the picture and so we must exercise great care in defining what we mean by 
a UR of a super Lie group. Initially we shall suppose we are given a UR 7 
of the classical group Gp in a super Hilbert space H and that the operators 
p(X)(X © gi) are all defined on C™(7), the space of differentiable vectors 
for m in H. We require the following properties. 


(i) mo is an even unitary representation of Go. 

(ii) p is a linear map of gi into the subspace of odd endomorphisms of 
C (m9). Here C™(m0) is the space of differentiable vectors for 7 (it 
is super linear, i.e., it contains the odd and even components of each 
of its elements). 

(iii) p satisfies the requirements below: 


(a) p(goX) = 0(go)p(X)m0(g0)~* (X € 91,90 € Go) (compatibility 
of p with 70). 

(b) p(X) with domain C'* (79) is symmetric for all X € gi. This means 
that the adjoint p(X)* is an extension of p(X) for X € gi. 

(c) —idmo([X, Y]) = p(X)p(Y) + p(Y )o(X)(X,¥ € gi) on C™ (719). 


It then turns out that for any X € g the operator p(X) with domain 
C™(m9) is essentially self adjoint, and, with ¢ = e~**/4, 


m: Xo + X1 +> dro(Xo) + C7" (X1)(Xo € go, X1 € 91) 


is a super representation of the super Lie algebra g in C™(29) compatible 
with 7. 


The choice of C™(79) as a domain of definition for the p(X)(X € gi), 
although natural, is certainly arbitrary; for instance we could have chosen 
the space of analytic vectors. But fortunately this does not create problems. 
If we use any other dense domain with some natural requirements, all the 
p(X)(X € gi) will have closures which are defined on C™ (79) and we have 
the earlier situation. Thus we have an essentially unique way to define the 
UR of a super Harish-Chandra pair. 

The classical Mackey theory is based on two themes. The first is an 
analysis of transitive systems of imprimitivity and the second is an applica- 
tion to the UIR’s of regular semi direct products. I shall now describe the 
generalizations to the super symmetric case. 
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The sub super Lie group H = (Ho, 4) of G = (Go, g) is special if hy = gi. 
The quotient super manifold Q = G/H ~ Go/Hpo is then purely even. 
A super system of imprimitivity based on Q is a system (7,7, P) where 
(7, p™) is a UR of (Go, g), P an even projection valued measure on 2 such 
that (7, P) is a classical system of imprimitivity and the projections of P 
commute with the spectral projections of p(X) for all X € gi (P © p”). 

The super imprimitivity theorem. There is a natural equivalence 
of categories from UR’s of (Ho,6) to super systems of imprimitivity on Q. 

The super Harish-Chandra pair(Go, g) is a super semidirect product if: 


(i) Go = To x’ Lo, To a real finite dimensional vector space, Lo C SL(To) 
a closed subgroup 
(ii) Tp acts trivially on gi and [gi, 91] C to := Lie(Zo) 


We can now describe the little group theory in the super symmetric 
case. For \ € T¥, S* = (GQ, g*) is the little (super) group at A: GA = ToL A 
and g* = tp ® G ® gi. It is a special sub super Lie group. For a UR (7, p”) 
of G = (Go,g), P is the spectral measure of TO Since To acts trivially 
on g1, the m(t) commute with the p(X). If the UR is irreducible (UIR), 
then P is concentrated on an orbit. If A € Tj, a UR of G is A-admissible 
if r(t) = e* T(t € Ty). X itself is admissible if there is a \-admissible UR 
(<=> if there is a A-admissible UIR). 


Ty = {A | €TG,A admissible}. 


Theorem 15.2. For any  € TS , the super imprimitivity theorem gives 
an equivalence of categories from the category of -admissible UR’s of S* 
with the UR’s of (Go,g) whose spectra are contained in the orbit of X. 
In particular, a UIR has spectrum in the orbit of X if and only if X is 
admissible, and then we have a bijection between the sets of equivalence 


classes of UIR’s of G and S%. 


Remark 15.1. Note the significant difference from the classical one in that 
there is a selection rule for the orbits: admissibility. 
Let \ be admissible and (0, p”) be a \-admissible UIR for $*. Then 


(i) Qy(X) = (1/2)A([X, X]) is a LA-invariant quadratic form on gi 
(ii) p?(X)? = Q)(X)I on C*(c) 


It follows, as p’(X) is essentially self adjoint on C™(c), that 


(1) Q) is nonnegative and the p? are bounded. 
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Theorem 15.3. Let \ € Tj. Then the following are equivalent. 


(i) X is admissible 
(ii) Q(X) > 0 for all X € gy. 


If G is a super Poincaré group, condition (ii) is essentially the condition 
that energy is positive. Hence we refer to (ii) as the positive energy condi- 
tion. 


Let C) be the algebra generated by g; with the relations 
x? = Qy\(X)1(X E g1)- 


Even though Q) may have a nonzero radical we call C, the Clifford algebra 
of (g1,Qy). If 


91d = gi/rad Q) 
then Q) is strictly positive on g;, and there is a natural map 
C, — Cy = Clifford algebra of gi) 


with kernel as the ideal generated by the radical of Q). 
One can build a UIR (a, p) of the little group $* with 


(i) p a representation of C, by bounded operators, p(X) self adjoint and 
odd for all X € gi; p is called a self adjoint representation. 
(ii) o is an even UR of LA such that 


o(t)p(X)o(t)"' = p(tX) (te Lo,X € gi) 


We shall assume for simplicity that LA is simply connected. Since Q) is 
L-invariant we have a map 


Ly — SO(g1a) 
which lifts to a map 
Ip — Spin(giy). 


There is an irreducible self adjoint representation 7, of Cy, finite dimen- 
sional, unique if dim(g,,) is odd, unique up to parity reversal otherwise. 
The spin representation of Spin(gi,) lifts to an even UR « of LA, with 


Ky(t)T)(X)K(t)7* = (EA) (£€ L3,X € gi). 
The assignment 


ri— +> 0,» =(¢,/), G= EP Shy, p=1®87 
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is an equivalence of categories from the category of purely even UR’s r of 

I to the category of \-admissible UR’s of the little super group S%. It 

gives a bijection (up to equivalence) between UIR’s of L@ and UIR’s of $?. 
When the little group is only connected but is of the form 


[LA=Ax'T (A simply connected , 7 a torus) 
Then the development is similar but a little more involved. The theory now 
gives a bijection 
Tr <—— Or) — Or 


between UIR’s r of Le and UIR’s 0, of G with spectrum in the orbit of 
XA. The ©, represent the super particles. The corresponding UR’s of Go 
are not irreducible and their irreducible constituents define the so-called 
super multiplets. The members of the multiplet are the ordinary particles 
that correspond to the orbit of A and the irreducible constituents of r @ kK}. 
When r is the trivial representation we obtain the fundamental multiplet. 
They are the ordinary particles defined by the orbit of \ and the irreducible 
constituents of &). In the case of super Poincaré groups & can be explicitly 
determined and its decomposition into irreducibles described (in principle). 
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